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NOVEL MOLECULES OF THE TANGO -77 RELATED PROTEIN 
FAMILY AND USES THEREOF 

Background of the Invention 
The polypeptide cytokine interleukin-1 (IL-1) 
5 is a critical mediator of inflammatory and overall immune 
response. To date, three members of the IL-1 family, 
IL-lar, IL-1/3 and IL-lra (Interleukin-1 receptor 
antagonist) have been isolated and cloned. IL-lar and 
IL-1/? are proinflammatory cytokines which elicit 
10 biological responses, whereas IL-lra is an antagonist of 
IL-lar and IL-1/3 activity. Two distinct cell-surface 
receptors have been identified for these ligands, the 
type 1 IL-1 receptor (IL-lRtI) and type II IL-1 receptor 
(IL-lRtll) . Recent results suggest that the IL-lRtI is 
is the receptor responsible for transducing a signal and 
producing biological effects. 

As mentioned above, IL-1 is a key mediator of the 
host inflammatory response. While inflammation is an 
important homeostatic mechanism, aberrant inflammation 
20 has the potential for inducing damage to the host. 

Elevated IL-1 levels are known to be associated with a 
number of diseases particularly autoimmune diseases and 
inflammatory disorders. 

Since II -Ira is a naturally occurring inhibitor of 
25 IL-1, IL-lra can be used to limit the aberrant and 

potentially deleterious effects of IL-1. In experimental 
animals, pretreatment with IL-lra has been shown to 
prevent death resulting from lipopolysaccharide- induced 
sepsis. The relative absence of IL-lra has also been 
30 suggested to play a role in human inflammatory bowel 
disease . 

Summary of the Invention 
The present invention is based, at least in part, 
on the discovery of a gene encoding Tango-77, a secreted 



-'SDOCID: <WO g9064?6A1„l_> 



# 

WO 99/06426 



PCT/US98/16102 



- 2 - 

protein that is predicted to be a member of the cytokine 
superfamily. The Tango- 77 cDNA described below (SEQ ID 
NO:l) has three possible open reading frames. The first 
potential open reading frame encompasses 534 nucleotides 
5 extending from nucleotide 3 56 to nucleotide 88 9 of SEQ ID 
NO:l (SEQ ID NO : 3 ) and encodes a 178 amino acid protein 
(SEQ ID NO: 2) . This protein may include a predicted 
signal sequence of about 63 amino acids (from about amino 
acid 1 to about amino acid 63 of SEQ ID NO : 2 (SEQ ID 
10 NO: 4) and a predicted mature protein of about 115 amino 
acids (from about amino acid 64 to amino acid 178 of SEQ 
ID NO: 2 (SEQ ID NO:5)). 

The second potential open reading frame 
encompasses 498 nucleotides extending from nucleotide 389 
15 to nucleotide 889 of SEQ ID NO:l (SEQ ID NO:6) and 

encodes a 167 amino acid protein (SEQ ID NO: 7) . This 
protein may include a predicted signal sequence of about 
52 amino acids (from about amino acid 1 to about amino 
acid 52 of SEQ ID NO:7 (SEQ ID NO:8)) and a predicted 
20 mature protein of about 115 amino acids (from about amino 
acid 52 to amino acid 167 of SEQ ID NO : 7 (SEQ ID NO:9)). 

The third potential open reading frame encompasses 
408 nucleotides extending from nucleotide 481 to 
nucleotide 88 9 of SEQ ID NO:l (SEQ ID NO: 10) and encodes 
25 a 136 amino acid protein (SEQ ID NO: 11) . This protein 
includes a predicted signal sequence of about 21 amino 
acids (from about amino acid 1 to about amino acid 21 of 
SEQ ID NO:ll (SEQ ID NO:12)) and a predicted mature 
protein of about 115 amino acids (from about amino acid 
30 22 to amino acid 136 of SEQ ID NO: 11 (SEQ ID NO: 13)) . 

As used herein, the terms "Tango-77 n , "Tango-77 
protein' 1 , ,! Tango-77 polypeptide" amd the like, can refer 
and polypeptide produced by the cDNA of SEQ ID NO:l 
including any and all of the Tango-77 gene products 
35 described above. 
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Tango- 77 is expected to inhibit inflammation and 
play a functional role similar to that of secreted 
IL-lra. For example, it is expected that Tango-77 may 
bind to the IL-1 receptor, thus blocking receptor 
5 activation by inhibiting the binding of IL-la and IL-1/3 
to the receptor. Alternatively, Tango-77 may inhibit 
inflammation through another pathway, for example, by 
binding to a novel receptor. Accordingly, Tango-77 may 
be useful as a modulating agent in regulating a variety 
10 of cellular processes including acute and chronic 

inflammation, e.g., asthma, chronic myelogenous leukemia, 
rheumatoid arthritis, psoriasis and inflammatory bowel 
disease . 

In one aspect, the invention provides isolated 

is nucleic acid molecules encoding Tango-77 or biologically 
active portions thereof, as well as nucleic acid 
fragments suitable as primers or hybridization probes for 
the detection of Tango-77. 

The invention encompasses methods of diagnosing 

20 and treating patients who are suffering from a disorder 
associated with an abnormal level (undesirably high or 
undesirably low) of inflammation, abnormal activity of 
the IL-1 receptor complex, or abnormal activity of IL-1, 
by administering a compound that modulates the expression 

25 of Tango-77 (at the DNA, mRNA or protein level, e.g., by 
altering mRNA splicing) or by altering the activity of 
Tango-77. Examples of such compounds include small 
molecules, antisense nucleic acid molecules, ribozymes, 
and polypeptides. 

30 The invention features a nucleic acid molecule 

which is at least 45% (e.g., 55%, 65%, 75%, 85%, 95%, or 
98%) identical to the nucleotide sequence shown in SEQ ID 
N0:1, SEQ ID NO : 3 , SEQ ID NO : 6 , SEQ ID NO: 10, the 
nucleotide sequence of the cDNA insert of the plasmid 
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deposited with ATCC as Accession Number (the " cDNA of 
ATCC 98807") # or a complement thereof. 

The invention features a nucleic acid molecule 
which includes a fragment of at least 100 (e.g., 250, 
5 325, 350, 375, 400, 425, 450, 500, 550, 600, 650, 700, 
800, 900, or 989) nucleotides of the nucleotide sequence 
shown in SEQ ID N0:1, SEQ ID NO : 3 , SEQ ID NO: 6, SEQ ID 
NO: 10, the nucleotide sequence of the cDNA ATCC 988 07, or 
a complement thereof. 
io The invention also features a nucleic acid 

molecule which includes a nucleotide sequence encoding a 
protein having an amino acid sequence that is at least 
45% (55%, 65%, 75%, 85%, 95%, or 98%) identical to the 
amino acid sequence of SEQ ID NO : 2 , SEQ ID NO : 5 , SEQ ID 
15 NO: 7, SEQ ID NO : 9 , SEQ ID NO: 11, SEQ ID NO: 13, or the 
amino acid sequence encoded by the cDNA of ATCC 98 807. 

In a preferred embodiment, a Tango- 77 nucleic acid 
molecule has the nucleotide sequence shown in SEQ ID 
NO:l, SEQ ID NO : 3 , SEQ ID NO: 6, SEQ ID NO: 10 or the 
20 nucleotide sequence of the cDNA of ATCC 98807. 

Also within the invention is a nucleic acid 
molecule which encodes a fragment of a polypeptide having 
the amino acid sequence of SEQ ID NO: 2, SEQ ID NO: 4, SEQ 
ID NO: 5, SEQ ID NO : 7 , SEQ ID NO: 8, SEQ ID NO : 9 , SEQ ID 
25 NO: 11, SEQ ID NO: 12, SEQ ID NO: 13, wherein the fragment 
includes at least 15 (e.g., 25, 30, 50, 100, 150, or 178) 
contiguous amino acids of SEQ ID NO : 2 , SEQ ID NO: 4, SEQ 
ID NO: 5, SEQ ID NO : 7 , SEQ ID NO : 8 , SEQ ID NO : 9 , SEQ ID 
NO: 11, SEQ ID NO: 12, SEQ ID NO: 13, or the polypeptide 
30 encoded by the cDNA of ATCC Accession Number 98807. 

The invention includes a nucleic acid molecule 
which encodes a naturally occurring allelic variant of a 
polypeptide comprising the amino acid sequence of SEQ ID 
NO: 2, SEQ ID NO : 4 , SEQ ID NO : 5 , SEQ ID NO : 7 , SEQ ID NO : 8 , 
35 SEQ ID NO: 9, SEQ ID NO: 11, SEQ ID NO : 12 , SEQ ID NO: 13, or 
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an amino acid sequence encoded by the cDNA of ATCC 
Accession Number 98807, wherein the nucleic acid molecule 
hybridizes to a nucleic acid molecule comprising SEQ ID 
NO:l, SEQ ID NO : 3 , SEQ ID NO : 6 , SEQ ID NO: 10, or a 
5 complement thereof under stringent conditions. 

Also within the invention are: an isolated 
Tango- 77 protein having an amino acid sequence that is at 
least about 45%, preferably 65%, 75%, 85%, 95%, or 98% 
identical to the amino acid sequence of SEQ ID NO: 5, SEQ 

io ID NO:9 or SEQ ID NO:13 (mature human Tango- 77 ) , or the 
amino acid sequence of SEQ ID NO: 2, SEQ ID NO : 7 or SEQ ID 
NO:ll (immature human Tango-77) . 

Also within the invention are: an isolated 
Tango-77 protein which is encoded by a nucleic acid 

is molecule having a nucleotide sequence that is at least 
about 65%, preferably 75%, 85%, or 95% identical to SEQ 
ID NO: 3, SEQ ID NO : 6 , SEQ ID NO: 10 or the cDNA of ATCC 
98807; and an isolated Tango-77 protein which is encoded 
by a nucleic acid molecule having a nucleotide sequence 

20 which hybridizes under stringent hybridization conditions 
to a nucleic acid molecule having the nucleotide sequence 
of SEQ ID NO:3, SEQ ID NO : 6 , SEQ ID NO: 10, the non-coding 
strand of the cDNA of ATCC 98807, or the complement 
thereof . 

25 Also within the invention is a polypeptide which 

is a naturally occurring allelic variant of a polypeptide 
that includes the amino acid sequence of SEQ ID NO : 2 , SEQ 
ID NO: 4, SEQ ID NO : 5 , SEQ ID NO : 7 , SEQ ID NO : 8 , SEQ ID 
NO: 9, SEQ ID NO: 11, SEQ ID NO: 12, SEQ ID NO: 13, or an 

3 0 amino acid sequence encoded by the cDNA insert of the 
plasmid deposited with ATCC as Accession Number 98807, 
wherein the polypeptide is encoded by a nucleic acid 
molecule which hybridizes to a nucleic acid molecule 
comprising SEQ ID NO : 1 , SEQ ID NO : 3 , SEQ ID NO : 6 , SEQ ID 
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NO: 10 or the complement thereof under stringent 
conditions . 

Another embodiment of the invention features 
Tango- 77 nucleic acid molecules which specifically detect 
5 Tango-77 nucleic acid molecules relative to nucleic acid 
molecules encoding other members of the cytokine 
superfamily. For example, in one embodiment, a Tango-77 
nucleic acid molecule hybridizes under stringent 
conditions to a nucleic acid molecule comprising the 
10 nucleotide sequence of SEQ ID NO:l, SEQ ID NO : 3 , SEQ ID 
NO: 6, SEQ ID NO: 10, the cDNA of ATCC 988 07, or a 
complement thereof. In another embodiment, the Tango-77 
nucleic acid molecule is at least 300 (325, 350, 375, 
400, 425, 450, 500, 550, 600, 650, 700, 800, 900, or 989) 
15 nucleotides in length and hybridizes under stringent 
conditions to a nucleic acid molecule comprising the 
nucleotide sequence shown in SEQ ID NO:l, SEQ ID NO: 3, 
SEQ ID NO: 6, SEQ ID NO: 10, the cDNA of ATCC 98807, or a 
complement thereof . In yet another embodiment , the 
20 invention provides an isolated nucleic acid molecule 
which is antisense to the coding strand of a Tango-77 
nucleic acid. 

Another aspect of the invention provides a vector, 
e.g., a recombinant expression vector, comprising a 
25 Tango-77 nucleic acid molecule of the invention. In 
another embodiment, the invention provides a host cell 
containing such a vector. The invention also provides a 
method for producing Tango-77 protein by culturing, in a 
suitable medium, a host cell of the invention containing 
30 a recombinant expression vector such that a Tango-77 
protein is produced. 

Another aspect of this invention features isolated 
or recombinant Tango-77 proteins and polypeptides. 
Preferred Tango-77 proteins and polypeptides possess at 
35 least one biological activity possessed by naturally 
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occurring human Tango-77, e.g., (i) the ability to 
interact with proteins in the Tango-77 signalling pathway 
(ii) the ability to interact with a Tango-77 ligand or 
receptor; or (iii) the ability to interact with an 
5 intracellular target protein, (iv) the ability to 

interact with a protein involved in inflammation and (v) 
the ability to bind the IL-1 receptor. Other activities 
include the induction and suppression of polypeptide 
interleukins , cytokines and growth factors. 

io The Tango-77 proteins of the present invention, or 

biologically active portions thereof, can be operably 
linked to a non-Tango-77 polypeptide (e.g., heterologous 
amino acid sequences) to form Tango-77 fusion proteins. 
The invention further features antibodies that 

is specifically bind Tango-77 proteins, such as monoclonal 
or polyclonal antibodies. In addition, the Tango-77 
proteins or biologically active portions thereof can be 
incorporated into pharmaceutical compositions, which 
optionally include pharmaceutically acceptable carriers. 

20 In another aspect, the present invention provides 

a method for detecting the presence of Tango-77 activity 
or expression in a biological sample by contacting the 
biological sample with an agent capable of detecting an 
indicator of Tango-77 activity or expression such that 

25 the presence of Tango-77 activity or expression is 
detected in the biological sample. 

In another aspect, the invention provides a method 
for modulating Tango-77 activity comprising contacting a 
cell with an agent that modulates (inhibits or 

30 stimulates) 

Tango-77 activity or expression such that Tango-77 
activity or expression in the cell is modulated. In one 
embodiment, the agent is an antibody that specifically 
binds to Tango-77 protein. In another embodiment, the 
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agent modulates expression of Tango- 77 by modulating 
transcription of a Tango-77 gene, splicing of a Tango-77 
mRNA, or translation of a Tango-77 mRNA. In yet another 
embodiment, the agent is a nucleic acid molecule having a 
5 nucleotide sequence that is antisense to the coding 
strand of the Tango-77 mRNA or the Tango-77 gene. 

In one embodiment, the methods of the present 
invention are used to treat a subject having a disorder 
characterized by aberrant Tango-77 protein activity or 

10 nucleic acid expression by administering an agent which 
is a Tango-77 modulator to the subject. In one 
embodiment, the Tango-77 modulator is a Tango-77 protein. 
In another embodiment, the Tango-77 modulator is a 
Tango-77 nucleic acid molecule. In other embodiments, 

is the Tango-77 modulator is a peptide, peptidomimetic , or 
other small molecule. In a preferred embodiment, the 
disorder characterized by aberrant Tango-77 protein or 
nucleic acid expression can include chronic and acute 
inf 1 ammat ion . 

20 The present invention also provides a diagnostic 

assay for identifying the presence or absence of a 
genetic lesion or mutation characterized by at least one 
of: (i) aberrant modification or mutation of a gene 
encoding a Tango-77 protein; (ii) mis-regulation of a 

25 gene encoding a Tango-77 protein; and (iii) aberrant 
post-translational modification of a Tango-77 protein, 
wherein a wild-type form of the gene encodes a protein 
with a Tango-77 activity. 

In another aspect, the invention provides a 

30 method for identifying a compound that binds to or 
modulates the activity of a Tango-77 protein. In 
general, such methods entail measuring a biological 
activity of a Tango-77 protein in the presence and 
absence of a test compound and identifying those 
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compounds which alter the activity of the Tango- 77 
protein. 

The invention also features methods for 
identifying a compound which modulates the expression of 
5 Tango-77 by measuring the expression of Tango- 77 in the 
presence and absence of a compound . 

Other features and advantages of the invention 
will be apparent from the following detailed description 
and claims . 

io Brief Description of the Drawings 

Figure 1 depicts the cDNA sequence (SEQ ID NO:l) 
of Tango-77. The Tango-77 cDNA has three possible open 
reading frames which encode the amino acid sequence (SEQ 
ID NO: 2, SEQ ID NO : 7 and SEQ ID NO: 11) of human Tango-77. 

is The three potential open reading frames of SEQ ID NO:l 

extend from: (1) nucleotide 356 to nucleotide 889 (SEQ ID 
NO:3); (2) nucleotide 389 to nucleotide 889 (SEQ ID 
NO:6); and (3) nucleotide 481 to nucleotide 889 (SEQ ID 
NO: 10) . 

20 Figure 2 depicts an alignment of an amino acid 

sequence of Tango-77 (T77; SEQ ID NO: 2) with IL-1RA (SEQ ' 
ID NO: 14) , and IL-l/S (SEQ ID NO: 15) . 

Figure 3 depicts the genomic sequence of BAC1 (SEQ 
ID NO: 16) - 

25 Figure 4 depicts the genomic sequence of BAC2 (SEQ 

ID NO: 17) . 

Figure 5 depicts an amino acid sequence of an 
alternatively spliced form of Tango-77 (SEQ ID NO: 2) as 
predicted by Procrustes (T77-procrustes ; SEQ ID NO:18). 
30 Figure 6 depicts an alignment of an amino acid 

sequence of an alternatively spliced form of Tango-77 
(T77-procrustes; SEQ ID NO: 18) with Tango-77 (SEQ ID 
NO: 2) . 
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Figure 7 depicts an alignment of an amino acid 
sequence of an alternatively spliced form of Tango-77 
(T77-procrustes; SEQ ID NO: 18) with IL-lra (SEQ ID 
NO: 14), and IL-1/3 (SEQ ID NO:15). 

5 Detailed Description of the Invention 

The present invention is based on the discovery of 
a cDNA molecule encoding human Tango-77, a member of the 
cytokine superfamily. The cDNA molecule encoding human 
Tango-77 has three possible open reading frames. The 
10 three possible nucleotide open reading frames for human 
Tango-77 protein are shown in Figure 1 (SEQ ID NO: 3, SEQ 
ID NO: 6 and SEQ ID NO: 10) . The predicted amino acid 
sequence for the three possible Tango-77 immature 
proteins are also shown in 
15 Figure 1 (SEQ ID NO: 2, SEQ ID NO: 7 or SEQ ID NO: 11) and 
three possible mature proteins are also shown in Figure 1 
(SEQ ID NO: 5, SEQ ID NO : 9 and SEQ ID NO: 13) . 

The Tango-77 cDNA of Figure 1 (SEQ ID NO : 1 ) , which 
is approximately 989 nucleotides long including 
20 untranslated regions, encodes a protein amino acid having 
a molecular weight of approximately 19 kDa, 18 kDa, or 
14.9 KDa (excluding post-translational modifications) and 
the possible mature form of the protein has a molecular 
weight of 13 kDa. A plasmid containing a cDNA encoding 
25 human Tango-77 (with the cDNA insert name of Of fthx077) 
was deposited with American Type Culture Collection 
(ATCC) , 10801 University Boulevard, Manassas, Virginia 
20110-2209 on July 2, 1998 and assigned Accession Number 
98807. This deposit will be maintained under the terms 
30 of the Budapest Treaty on the International Recognition 
of the Deposit of Microorganisms for the Purposes of 
Patent Procedure. This deposit was made merely as a 
convenience for those of skill in the art and is not an 
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admission that a deposit is required under 3 5 U.S.C. 
§112 . 

Human Tango-77 is one member of a family of 
molecules (the "Tango-77 family") having certain 
5 conserved structural and functional features. The term 
"family, " when referring to the protein and nucleic acid 
molecules of the invention, is intended to mean two or 
more proteins or nucleic acid molecules having a common 
structural domain and having sufficient amino acid or 
10 nucleotide sequence identity as defined herein. Such 

family members can be naturally occurring and can be from 
either the same or different species. For example, a 
family can contain a first protein of human origin and a 
homologue of that protein of murine origin, as well as a 
is second, distinct protein of human origin and a murine 

homologue of that protein. Members of a family may also 
have common functional characteristics. 

As used interchangeably herein a "Tango-77 
activity", "biological activity of Tango-77" or 
20 "functional activity of Tango-77", refers to an activity 
exerted by a Tango-77 protein, polypeptide or nucleic 
acid molecule on a Tango-77 responsive cell as determined 
in vivo, or in vitro, according to standard techniques. 
A Tango-77 activity can be a direct activity, such as an 
25 association with a second protein, or an indirect 

activity, such as a cellular signaling activity mediated 
by interaction of the Tango-77 protein with a second 
protein. In a preferred embodiment, a Tango-77 activity 
includes at least one or more of the following 
30 activities: (i) the ability to interact with proteins in 
the Tango-77 signalling pathway (ii) the ability to 
interact with a Tango-77 ligand or receptor; or (iii) the 
ability to interact with an intracellular target protein, 
(iv) the ability to interact with a protein involved in 
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inflammation, and (v) the ability to bind the IL.-1 
receptor. 

Accordingly, another embodiment of the invention 
features isolated Tango- 77 proteins and polypeptides 
5 having a Tango-77 activity. 

Yet another embodiment of the invention features 
Tango-77 molecules which contain a signal sequence. 
Generally, a signal sequence (or signal peptide) is a 
peptide containing about 21 to 63 amino acids which 
10 occurs at the extreme N-terminal end of a secretory 
protein. The native Tango-77 signal sequence (SEQ ID 
NO: 4, SEQ ID NO : 8 , or SEQ ID NO: 12) can be removed and 
replaced with a signal sequence from another protein. In 
certain host cells (e.g., mammalian host cells), 
is expression and/or secretion of Tango-77 can be increased 
through use of a heterologous signal sequence. For 
example, the gp67 secretory sequence of the baculovirus 
envelope protein can be used as a heterologous signal 
sequence. Alternatively, the native Tango-77 signal 
20 sequence can itself be used as a heterologous signal 

sequence in expression systems, e.g., to facilitate the 
secretion of a protein of interest. 

Various aspects of the invention are described in 
further detail in the following subsections. 

25 I. Isolated Nucleic Acid Molecules 

One aspect of the invention pertains to isolated 
nucleic acid molecules that encode Tango-77 proteins or 
biologically active portions thereof, as well as nucleic 
acid molecules sufficient for use as hybridization probes 

30 to identify Tango- 77 -encoding nucleic acids (e.g., 

Tango-77 mRNA) and fragments for use as PCR primers for 
the amplification or mutation of Tango-77 nucleic acid 
molecules. As used herein, the term "nucleic acid 
molecule" is intended to include DNA molecules (e.g., 
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cDNA or genomic DNA) and RNA molecules (e.g., mRNA) and 
analogs of the DNA or RNA generated using nucleotide 
analogs. The nucleic acid molecule can be single- 
stranded or double -stranded, but preferably is double- 
5 stranded DNA. 

An 11 isolated" nucleic acid molecule is one which 
is separated from other nucleic acid molecules which are 
present in the natural source of the nucleic acid. 
Preferably, an "isolated" nucleic acid is free of 
10 sequences (preferably protein encoding sequences) which 
naturally flank the nucleic acid (i.e., sequences located 
at the 5' and 3' ends of the nucleic acid) in the genomic 
DNA of the organism from which the nucleic acid is 
derived. For example, in various embodiments, the 
15 isolated Tango-77 nucleic acid molecule can contain less 
than about 5 kb, 4 kb, 3 kb, 2 kb, 1 kb, 0.5 kb or 0.1 kb 
of nucleotide sequences which naturally flank the nucleic 
acid molecule in genomic DNA of the cell from which the 
nucleic acid is derived. Moreover, an "isolated" nucleic 
20 acid molecule, such as a cDNA molecule, can be 

substantially free of other cellular material, or culture 
medium when produced by recombinant techniques, or 
substantially free of chemical precursors or other 
chemicals when chemically synthesized. 
25 A nucleic acid molecule of the present invention, 

e.g., a nucleic acid molecule having the nucleotide 
sequence of SEQ ID N0:1, SEQ ID NO: 3, SEQ ID NO : 6 , SEQ ID 
NO: 10, the cDNA of ATCC 98807, or a complement of any of 
these nucleotide sequences, can be isolated using 
30 standard molecular biology techniques and the sequence 
information provided herein. Using all or a portion of 
the nucleic acid sequences of SEQ ID NO:l, SEQ ID NO: 3, 
SEQ ID NO: 6, SEQ ID NO: 10, the cDNA of ATCC 98807, or the 
complement thereof as a hybridization probe, Tango-77 
35 nucleic acid molecules can be isolated using standard 
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hybridization and cloning techniques (e.g., as described 
in Sambrook et al . , eds . , Molecular Cloning: A 
Laboratory Manual, 2nd ed., Cold Spring Harbor 
Laboratory, Cold Spring Harbor Laboratory Press, Cold 
Spring Harbor, NY, 1989) . 

A nucleic acid of the invention can be amplified 
using cDNA, mRNA or genomic DNA as a template and 
appropriate oligonucleotide primers according to standard 
PGR amplification techniques. The nucleic acid so 
amplified can be cloned into an appropriate vector and 
characterized by DNA sequence analysis. Furthermore, 
oligonucleotides corresponding to Tango-77 nucleotide 
sequences can be prepared by standard synthetic 
techniques, e.g., using an automated DNA synthesizer. 

In another preferred embodiment, an isolated 
nucleic acid molecule of the invention comprises a 
nucleic acid molecule which is a complement of the 
nucleotide sequence shown in SEQ ID NO:l, SEQ ID NO: 3, 
SEQ ID NO:6, SEQ ID NO: 10 the cDNA of ATCC 98807, or a 
o portion thereof. A nucleic acid molecule which is 

complementary to a given nucleotide sequence is one which 
is sufficiently complementary to the given nucleotide 
sequence that it can hybridize to the given nucleotide 
sequence thereby forming a stable duplex. 
5 Moreover, the nucleic acid molecule of the 

invention can comprise only a portion of a nucleic acid 
sequence encoding Tango-77, for example, a fragment which 
can be used as a probe or primer or a fragment encoding a 
biologically active portion of Tango-77. The nucleotide 
o sequence determined from the cloning of the human 

Tango-77 gene allows for the generation of probes and 
primers designed for use in identifying and/or cloning 
Tango-77 homologues in other cell types, e.g., from other 
tissues, as well as Tango-77 homologues from other 
5 mammals. The probe/primer typically comprises 
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substantially purified oligonucleotide. The 
oligonucleotide typically comprises a region of 
nucleotide sequence that hybridizes under stringent 
conditions to at least about 12, preferably about 25, 
5 more preferably about 50, 75, 100, 125, 150, 175, 200, 

250, 300, 350 or 400 consecutive nucleotides of the sense 
or anti-sense sequence of SEQ ID NO:l, SEQ ID NO : 3 , SEQ 
ID NO: 6, SEQ ID NO: 10, or the cDNA of ATCC 98807. 
Alternatively, the oligonucleotide can typically comprise 
10 a region of nucleotide sequence that hybridizes under 
stringent conditions to at least about 12, preferably 
about 25, more preferably about 50, 75, 100, 125, 150, 
175, 200, 250, 300, 350 or 400 consecutive nucleotides of 
the sense or anti-sense sequence of a naturally occurring 
15 mutant of SEQ ID NO:l, SEQ ID NO: 3, SEQ ID NO : 6 , SEQ ID 
NO: 10, or the cDNA of ATCC 98807. 

Probes based on the human Tango- 77 nucleotide 
sequence can be used to detect transcripts or genomic 
sequences encoding the same or identical proteins. The 
20 probe comprises a label group attached thereto, e.g., a 
radioisotope, a fluorescent compound, an enzyme, or an 
enzyme co-factor. Such probes can be used as a part of a 
diagnostic test kit for identifying cells or tissues 
which mis-express a Tango- 77 protein, such as by 
25 measuring a level of a Tango- 77 -encoding nucleic acid in 
a sample of cells from a subject, e.g., detecting 
Tango- 77 mRNA levels or determining whether a genomic 
Tango-77 gene has been mutated or deleted. 

A nucleic acid fragment encoding a "biologically 
30 active portion of Tango-77" can be prepared by isolating 
a portion of SEQ ID N0:1, SEQ ID NO: 3, SEQ ID NO : 6 , SEQ 
ID NO: 10 or the nucleotide sequence of the cDNA of ATCC 
988 07 which encodes a polypeptide having a Tango-77 
biological activity, expressing the encoded portion of 
35 Tango-77 protein (e.g., by recombinant expression in 
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vitro) and assessing the activity of the encoded portion 
of Tango-77. 

The invention further encompasses nucleic acid 
molecules that differ from the nucleotide sequence of SEQ 

5 ID NO:l, SEQ ID NO:3, SEQ ID NO : 6 , SEQ ID NO:10, or the 
cDNA of ATCC 98807 due to degeneracy of the genetic code 
and thus encode the same Tango- 77 protein as that encoded 
by the nucleotide sequence shown in SEQ ID NO:!, SEQ ID 
NO: 3, SEQ ID NO : 6 , SEQ ID NO: 10, or the cDNA of ATCC 

10 98807. 

In addition to the human Tango-77 nucleotide 
sequence shown in SEQ ID NO : 1 , SEQ ID NO : 3 , SEQ ID NO : 6 , 
SEQ ID NO: 10, or the cDNA of ATCC 98807, it will be 
appreciated by those skilled in the art that DNA sequence 
is polymorphisms that lead to changes in the amino acid 
sequences of Tango-77 may exist within a population 
(e.g., the human population). Such genetic polymorphism 
in the Tango-77 gene may exist among individuals within a 
population due to natural allelic variation. An allele 
20 is one of a group of genes which occur alternatively at a 
given genetic locus. As used herein, the term "allelic 
variant" refers to a nucleotide sequence which occurs at 
a Tango-77 locus or to a polypeptide encoded by the 
nucleotide sequence. As used herein, the terms "gene" 
25 and "recombinant gene" refer to nucleic acid molecules 
comprising an open reading frame encoding a Tango-77 
protein, preferably a mammalian Tango-77 protein. Such 
natural allelic variations can typically result in 1-5% 
variance in the nucleotide sequence of the Tango-77 gene. 
30 Alternative alleles can be identified by sequencing the 
gene of interest in a number of different individuals. 
This can be readily carried out by using hybridization 
probes to identify the same genetic locus in a variety of 
individuals. Any and all such nucleotide variations and 
35 resulting amino acid polymorphisms or variations in 
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Tango-77 that are the result of natural allelic variation 
and that do not alter the functional activity of Tango-77 
are intended to be within the scope of the invention. 

Moreover, nucleic acid molecules encoding Tango-77 
5 proteins from other species (Tango-77 homologues) , which 
have a nucleotide sequence which differs from that of a 
human Tango^77, are intended to be within the scope of 
the invention. Nucleic acid molecules corresponding to 
natural allelic variants and homologues of the Tango-77 
10 cDNA of the invention can be isolated based on their 
identity to the human Tango-77 nucleic acids disclosed 
herein using the human cDNAs, or a portion thereof, as a 
hybridization probe according to standard hybridization 
techniques under stringent hybridization conditions, 
is Accordingly, in another embodiment, an isolated 

nucleic acid molecule of the invention is at least 300 
(325, 350, 375, 400, 425, 450, 500, 550, 600, 650, 700, 
800, or 989) nucleotides in length and hybridizes under 
stringent conditions to the nucleic acid molecule 
20 comprising the nucleotide sequence, preferably the coding 
sequence, of SEQ ID NO:l, SEQ ID NO : 3 , SEQ ID NO : 6 , SEQ 
ID NO: 10, or the cDNA of ATCC 98807. 

As used herein, the term "hybridizes under 
stringent conditions" is intended to describe conditions 
25 for hybridization and washing under which nucleotide 
sequences at least 60% (65%, 70%, preferably 75%) 
identical to each other typically remain hybridized to 
each other. Such stringent conditions are known to those 
skilled in the art and can be found in Current Protocols 
30 in Molecular Biology, John Wiley & Sons, N.Y. (1989) , 
6.3.1-6.3.6. A preferred, non-limiting example of 
stringent hybridization conditions are hybridization in 
6X sodium chloride/sodium citrate (SSC) at about 45°C, 
followed by one or more washes in 0.2X SSC, 0.1% SDS at 
35 50-65°C. Preferably, an isolated nucleic acid molecule 
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of the invention that hybridizes under stringent 
conditions to the sequence of SEQ ID NO:l, SEQ ID NO: 3, 
SEQ ID NO: 6, SEQ ID NO: 10, the cDNA of ATCC 98807, or the 
complement thereof, corresponds to a naturally-occurring 
5 nucleic acid molecule. As used herein, a "naturally- 
occurring" nucleic acid molecule refers to an RNA or DNA 
molecule having a nucleotide sequence that occurs in 
nature (e.g., encodes a natural protein). 

In addition to naturally-occurring allelic 
10 variants of the Tango- 77 sequence that may exist in the 
population, the skilled artisan will further appreciate 
that changes can be introduced by mutation into the 
nucleotide sequence of SEQ ID NO : 1 , SEQ ID NO: 3, SEQ ID 
NO: 6, SEQ ID NO: 10 or the cDNA of ATCC 98 807, thereby 
15 leading to changes in the amino acid sequence of the 

encoded Tango-77 protein, without altering the biological 
activity of the Tango-77 protein. Amino acid residues 
that are not conserved or only semiconserved among Tango- 
77 of various species may be non-essential for activity 
20 and thus would likely be targets for alteration. 

Alternatively, one can make nucleotide substitutions 
leading to amino acid substitutions at "non-essential" 
amino acid residues. A "non-essential" amino acid 
residue is a residue that can be altered from the wild- 
25 type sequence of Tango-77 (e.g., the sequence of SEQ ID 
NO: 2, SEQ ID NO : 5 , SEQ ID NO: 7, SEQ ID NO: 9, SEQ ID NO: 11 
or SEQ ID NO: 13) without altering the biological 
activity, whereas an "essential" amino acid residue is 
required for biological activity. For example, amino 
30 acid residues that are conserved among the Tango-77 

proteins of various species may be essential for activity 
and thus would not likely be targets for alteration, 
unless one wishes to reduce or alter Tango-77 activity. 
Accordingly, another aspect of the invention 
35 pertains to nucleic acid molecules encoding Tango-77 
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proteins that contain changes in amino acid residues that 
are not essential for activity. Such Tango- 77 proteins 
differ in amino acid sequence from SEQ ID NO:2, SEQ ID 
NO: 5, SEQ ID NO : 7 , SEQ ID NO : 9 , SEQ ID NO ill, or SEQ ID 
5 NO: 13 yet retain biological activity. In one embodiment, 
the isolated nucleic acid molecule includes a nucleotide 
sequence encoding a protein that includes an amino acid 
sequence that is at least about 45% identical, 65%, 75%, 
85% , 95% , or 98% identical to the amino acid sequence of 

10 SEQ ID NO:2, SEQ ID NO : 5 , SEQ ID NO : 7 , SEQ ID NO : 9 , SEQ 
ID NO: 11, or SEQ ID NO: 13. 

An isolated nucleic acid molecule encoding a 
Tango-77 protein having a sequence which differs from 
that of SEQ ID NO: 2, SEQ ID NO : 5 , SEQ ID NO : 7 , SEQ ID 

15 NO: 9, SEQ ID NO: 11, or SEQ ID NO: 13 can be created by 
introducing one or more nucleotide substitutions, 
additions or deletions into the nucleotide sequence of 
SEQ ID NO:l, SEQ ID NO : 3 , SEQ ID NO : 6 , SEQ ID NO: 10, or 
the cDNA of ATCC 98807 such that one or more amino acid 

20 substitutions, additions or deletions are introduced into 
the encoded protein. Mutations can be introduced by 
standard techniques, such as site -directed mutagenesis 
and PCR-mediated mutagenesis. Preferably, conservative 
amino acid substitutions are made at one or more 

25 predicted non-essential amino acid residues. A 

"conservative amino acid substitution" is one in which 
the amino acid residue is replaced with an amino acid 
residue having a similar side chain. Families of amino 
acid residues having similar side chains have been 

30 defined in the art. These families include amino acids 
with basic side chains (e.g., lysine, arginine, 
histidine) , acidic side chains (e.g., aspartic acid, 
glutamic acid), uncharged polar side chains (e.g., 
glycine, asparagine, glutamine, serine, threonine, 

35 tyrosine, cysteine), nonpolar side chains (e.g., alanine, 
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valine, leucine, isoleucine, proline, phenylalanine, 
methionine, tryptophan), beta-branched side chains (e.g., 
threonine, valine, isoleucine) and aromatic side chains 
(e.g., tyrosine, phenylalanine, tryptophan, histidine) . 
5 Thus, a predicted nonessential amino acid residue in 
Tango- 7 7 is preferably replaced with another amino acid 
residue from the same side chain family. Alternatively, 
mutations can be introduced randomly along all or part of 
a Tango- 77 coding sequence, such as by saturation 
10 mutagenesis, and the resultant mutants can be screened 

for Tango-77 biological activity to identify mutants that 
retain activity. Following mutagenesis, the encoded 
protein can be expressed recombinantly and the activity 
of the protein can be determined. 
15 in a preferred embodiment, a mutant Tango-77 

protein can be assayed for: (1) the ability to form 
protein: protein interactions with proteins in the 
Tango-77 signalling pathway; (2) the ability to bind a 
Tango-77 ligand or receptor; or (3) the ability to bind 
20 to an intracellular target protein or (4) the ability to 
interact with a protein involved in inflammation or (5) 
the ability to bind the IL-1 receptor. In yet another 
preferred embodiment, a mutant Tango-77 can be assayed 
for the ability to modulate inflammation, asthma, 
25 autoimmune dieseases, and sepsis. 

The present invention encompasses antisense 
nucleic acid molecules, i.e., molecules which are 
complementary to a sense nucleic acid encoding a protein, 
e.g., complementary to the coding strand of a double- 
30 stranded cDNA molecule or complementary to an mRNA 

sequence. Accordingly, an antisense nucleic acid can 
hydrogen bond to a sense nucleic acid. The antisense 
nucleic acid can be complementary to an entire Tango-77 
coding strand, or to only a portion thereof, e.g., all or 
35 part of the protein coding region (or open reading 
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frame) . An antisense nucleic acid molecule can be 
ant i sense to a noncoding region of the coding strand of a 
nucleotide sequence encoding Tango-77. The noncoding 
regions ("5' and 3' untranslated regions") are the 5' and 
5 3' sequences which flank the coding region and are not 
translated into amino acids. 

Given the coding strand sequences encoding 
Tango-77 disclosed herein (e.g., SEQ ID NO: 3, SEQ ID NO : 5 , 
or SEQ ID NO:8), antisense nucleic acids of the invention 
10 can be designed according to the rules of Watson and 

Crick base pairing. The antisense nucleic acid molecule 
can be complementary to the entire coding region of 
Tango-77 mRNA, but more preferably is an oligonucleotide 
which is antisense to only a portion of the coding or 
15 noncoding region of Tango-77 mRNA. For example, the 
antisense oligonucleotide can be complementary to the 
region surrounding the translation start site of Tango-77 
mRNA, e.g., an oligonucleotide having the sequence 
5 ' - TGCAACTTTTACAGGAAACAC - 3 ' ( SEQ ID NO: 19) or 
20 5 ' -CCTCACTTTTACCCGAGACTC-3 ' (SEQ ID NO: 20) or 

5 ' -GACGGGTGGTACTTAAAACAA-3 ' (SEQ ID NO: 21). An antisense 
oligonucleotide can be, for example, about 5, 10, 15, 20, 
25, 30, 35, 40, 45 or 50 nucleotides in length. An 
antisense nucleic acid of the invention can be 
25 constructed using chemical synthesis and enzymatic 

ligation reactions using procedures known in the art. 
For example, an antisense nucleic acid (e.g., an 
antisense oligonucleotide) can be chemically synthesized 
using naturally occurring nucleotides or variously 
30 modified nucleotides designed to increase the biological 
stability of the molecules or to increase the physical 
stability of the duplex formed between the antisense and 
sense nucleic acids, e.g., phosphorothioate derivatives 
and acridine substituted nucleotides can be used. 
35 Examples of modified nucleotides which can be used to 
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generate the antisense nucleic acid include 5- 
f luorour ac i 1 , 5 - bromour ac i 1 , 5 - chl orourac i 1 , 
5 - iodourac i 1 , hypoxanthine , xanthine , 4 - acetyl cyt os ine , 
5- (carboxyhydroxylmethyl) uracil, 5- 
5 carboxymethylaminomethyl -2 - thiouridine , 

5-carboxymethylaminomethyluracil , dihydrouracil , 
beta-D-galactosylqueosine , inosine , N6 - 
isopentenyladenine, 1-methylguanine, 1 -methyl inosine , 
2 , 2-dimethylguanine # 2-methyladenine, 2 -methylguanine , 3- 
10 methyl cytosine, 5 -methylcytosine , N6-adenine, 7- 
methylguanine , 5 -methylaminomethyluracil , 5 - 
methoxyaminomethyl -2 -thiouracil , beta-D-mannosylqueosine, 
5 ' -methoxycarboxymethyluracil , 5-methoxyuracil , 2- 
methyl thio-N6 - isopentenyladenine , uracil - 5 -oxyacet ic acid 
is (v) , wybutoxosine, pseudouracil , queosine, 2- 

thiocytosine , 5 -methyl -2 -thiouracil , 2 - thiouracil , 4 - 
thiouracil, 5 -methyluracil , uracil- 5 -oxyacetic acid 
methylester, uracil -5 -oxyacetic acid (v) , 5-methyl-2- 
thiouracil, 3- (3-amino-3-N-2-carboxypropyl) uracil 
20 (acp3)w, and 2 , 6-diaminopurine . Alternatively, the 

antisense nucleic acid can be produced biologically using 
an expression vector into which a nucleic acid has been 
subcloned in an antisense orientation (i.e., RNA 
transcribed from the inserted nucleic acid will be of an 
25 antisense orientation to a target nucleic acid of 

interest, described further in the following subsection) . 

The antisense nucleic acid molecules of the 
invention are typically administered to a subject or 
generated in situ such that they hybridize with or bind 
30 to cellular mRNA and/or genomic DNA encoding a Tango-77 
protein to thereby inhibit expression of the protein, 
e.g., by inhibiting transcription and/or translation. 
The hybridization can be by conventional nucleotide 
complementarity to form a stable duplex, or, for example, 
35 in the case of an antisense nucleic acid molecule which 
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binds to DNA duplexes, through specific interactions in 
the major groove of the double helix. An example of a 
route of administration of antisense nucleic acid 
molecules of the invention includes direct injection at a 
5 tissue site. Alternatively, antisense nucleic acid 
molecules can be modified to target selected cells and 
then administered systemically . For example, for 
systemic administration, antisense molecules can be 
modified such that they specifically bind to receptors or 

io antigens expressed on a selected cell surface, e.g., by 
linking the antisense nucleic acid molecules to peptides 
or antibodies which bind to cell surface receptors or 
antigens. The antisense nucleic acid molecules can also 
be delivered to cells using the vectors described herein. 

is To achieve sufficient intracellular concentrations of the 
antisense molecules, vector constructs in which the 
antisense nucleic acid molecule is placed under the 
control of a strong pol II or pol III promoter are 
preferred. 

20 An antisense nucleic acid molecule of the 

invention can be an a-anomeric nucleic acid molecule. An 
of-anomeric nucleic acid molecule forms specific double- 
stranded hybrids with complementary RNA in which, 
contrary to the usual jS-units, the strands run parallel 

25 to each other (Gaultier et al . (1987) Nucleic Acids Res. 
15:6625-6641). The antisense nucleic acid molecule can 
also comprise a 2 ' -o-methylribonucleotide (Inoue et al . 
(1987) Nucleic Acids Res. 15:6131-6148) or a chimeric 
RNA-DNA analogue (Inoue et al . (1987) FEBS Lett. 215:327- 

30 330) . 

The invention also encompasses ribozymes. 
Ribozymes are catalytic RNA molecules with ribonuclease 
activity which are capable of cleaving a single-stranded 
nucleic acid, such as an mRNA, to which they have a 
35 complementary region. Thus, ribozymes (e.g., hammerhead 
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ribozymes (described in Haselhoff and Gerlach (1988) 
Nature 334:585-591)) can be used to catalytically cleave 
Tango- 7 7 mRNA transcripts to thereby inhibit translation 
of Tango-77 mRNA. A ribozyme having specificity for a 
5 Tango- 77 -encoding nucleic acid can be designed based upon 
the nucleotide sequence of a Tango-77 cDNA disclosed 
herein (e.g., SEQ ID NO:!, SEQ ID NO:3, SEQ ID NO:6, SEQ 
ID NO: 10) . For example, a derivative of a Tetrahymena L- 
19 IVS RNA can be constructed in which the nucleotide 
10 sequence of the active site is complementary to the 

nucleotide sequence to be cleaved in a Tango-7 7 -encoding 
mRNA. See, e.g., Cech et al . U.S. Patent No. 4,987,071; 
and Cech et al . U.S. Patent No. 5,116,742. 
Alternatively, Tango-77 mRNA can be used to select a 
is catalytic RNA having a specific ribonuclease activity 
from a pool of RNA molecules. See, e.g., Bartel and 
Szostak (1993) Science 261:1411-1418. 

The invention also encompasses nucleic acid 
molecules which form triple helical structures. For 
20 example, Tango-77 gene expression can be inhibited by 
targeting nucleotide sequences complementary to the 
regulatory region of the Tango-77 (e.g., the Tango-77 
promoter and/or enhancers) to form triple helical 
structures that prevent transcription of the Tango-77 
25 gene in target cells. See generally, Helene (1991) 

Anticancer Drug Des. 6 (6) : 569-84 ; Helene (1992) Ann. N . Y. 
Acad. Sci. 660:27-36; and Maher (1992) Bioassays 
14 (12) :807-15. 

In preferred embodiments, the nucleic acid 
30 molecules of the invention can be modified at the base 
moiety, sugar moiety or phosphate backbone to improve, 
e.g., the stability, hybridization, or solubility of the 
molecule. For example, the deoxyribose phosphate 
backbone of the nucleic acids can be modified to generate 
35 peptide nucleic acids (see Hyrup et al . (1996) Bioorganic 
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& Medicinal Chemistry 4(1) : 5-23) . As used herein, the 
terms "peptide nucleic acids" or "PNAs" refer to nucleic 
acid mimics, e.g., DNA mimics, in which the deoxyribose 
phosphate backbone is replaced by a pseudopeptide 
5 backbone and only the four natural nucleobases are 

retained. The neutral backbone of PNAs has been shown to 
allow for specific hybridization to DNA and RNA under 
conditions of low ionic strength. The synthesis of PNA 
oligomers can be performed using standard solid phase 

10 peptide synthesis protocols as described in Hyrup et al . 
(1996) supra; Perry-O' Keef e et al . (1996) Proc. Natl. 
Acad. Sci. USA 93: 14670-675. 

PNAs of Tango-77 can be used in therapeutic and 
diagnostic applications. For example, PNAs can be used 

is as antisense or antigene agents for sequence-specific 
modulation of gene expression by, e.g., inducing 
transcription or translation arrest or inhibiting 
replication. PNAs of Tango-77 can also be used, e.g., in 
the analysis of single base pair mutations in a gene by, 

20 e.g., PNA directed PCR clamping; as artificial 

restriction enzymes when used in combination with other 
enzymes, e.g., SI nucleases (Hyrup (1996) supra; or as 
probes or primers for DNA sequence and hybridization 
(Hyrup (1996) supra; Perry-O' Keef e et al . (1996) Proc . 

25 Natl. Acad. Sci. USA 93: 14670-675). 

In another embodiment, PNAs of Tango-77 can be 
modified, e.g., to enhance their stability or cellular 
uptake, by attaching lipophilic or other helper groups to 
PNA, by the formation of PNA -DNA chimeras, or by the use 

30 of liposomes or other techniques of drug delivery known 
in the art. For example, PNA -DNA chimeras of Tango-77 
can be generated which may combine the advantageous 
properties of PNA and DNA. Such chimeras allow DNA 
recognition enzymes, e.g., RNAse H and DNA polymerases, 
35 to interact with the DNA portion while the PNA portion 
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would provide high binding affinity and specificity. 
PNA-DNA chimeras can be linked using linkers of 
appropriate lengths selected in terms of base stacking, 
number of bonds between the nucleobases, and orientation 
5 (Hyrup (19 96) supra) . The synthesis of PNA-DNA chimeras 
can be performed as described in Hyrup (1996) supra and 
Finn et al . (1996) Nucleic Acids Res. 24 (17) : 3357-63 . 
For example, a DNA chain can be synthesized on a solid 
support using standard phosphoramidite coupling chemistry 
10 and modified nucleoside analogs. Compounds such as 5'- 

(4 -methoxytrityl ) amino-5' -deoxy- thymidine phosphoramidite 
can be used as a link between the PNA and the 5' end of 
DNA (Mag et al . (1989) Nucleic Acid Res. 17:5973-88). 
PNA monomers are then coupled in a stepwise manner to 
is produce a chimeric molecule with a. 5' PNA segment and a 
3' DNA segment (Finn et al . (1996) Nucleic Acids Res. 
24 (17) :3357-63) . Alternatively, chimeric molecules can 
be synthesized with a 5' DNA segment and a 3' PNA segment 
(Peterser et al . (1975) Bioorganic Med. Chem. Lett. 
20 5 : 1119-11124) . 

In other embodiments, the oligonucleotide may 
include other appended groups such as peptides (e.g., for 
targeting host cell receptors in vivo) , or agents 
facilitating transport across the cell membrane (see, 
25 e.g., Letsinger et al . (198 9) Proc . Natl. Acad. Sci. USA 
86:6553-6556; Lemaitre et al . (1987) Proc. Natl. Acad. 
Sci. USA 84:648-652; PCT Publication No. WO 88/09810) or 
the blood-brain barrier (see, e.g., PCT Publication No. 
W0 89/10134) . In addition, oligonucleotides can be 
30 modified with hybridization- triggered cleavage agents 

(see, e.g., Krol et al . (1988) Bio/Techniques 6:958-976) 
or intercalating agents (see, e.g., Zon (1988) Pharm. 
Res. 5:539-549). To this end, the oligonucleotide may be 
conjugated to another molecule, e.g., a peptide, 
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hybridization triggered cross-linking agent, transport 
agent, hybridization- triggered cleavage agent, etc. 

II. Isolated Tango-77 Proteins and Ant i -Tango-77 
Antibodies 

5 One aspect of the invention pertains to isolated 

Tango-77 proteins, and biologically active portions 
thereof, as well as polypeptide fragments suitable for 
use as immunogens to raise anti-Tango-77 antibodies. In 
one embodiment, native Tango-77 proteins can be isolated 
10 from cells or tissue sources by an appropriate 

purification scheme using standard protein purification 
techniques. In another embodiment, Tango-77 proteins are 
produced by recombinant DNA techniques. Alternative to 
recombinant expression, a Tango-77 protein or polypeptide 
15 can be synthesized chemically using standard peptide 
synthesis techniques . 

An "isolated" or "purified" protein or 
biologically active portion thereof is substantially free 
of cellular material or other contaminating proteins from 
20 the cell or tissue source from which the Tango-77 protein 
is derived, or substantially free of chemical precursors 
or other chemicals when chemically synthesized. The 
language "substantially free of cellular material" 
includes preparations of Tango-77 protein in which the 
25 protein is separated from cellular components of the 
cells from which it is isolated or recombinantly 
produced. Thus, Tango-77 protein that is substantially 
free of cellular material includes preparations of 
Tango-77 protein having less than about 30%, 20%, 10%, or 
30 5% (by dry weight) of non-Tango-77 protein (also referred 
to herein as a "contaminating protein") . When the 
Tango-77 protein or biologically active portion thereof 
is recombinantly produced, it is also preferably 
substantially free of culture medium, i.e., culture 



9Q0642SA1 I » 



WO 99/06426 



PCT/US98/16102 



- 28 - 

medium represents less than about 2 0%, 10%, or 5% of the 
volume of the protein preparation. When Tango-77 protein 
is produced by chemical synthesis, it is preferably 
substantially free of chemical precursors or other 
5 chemicals, i.e., it is separated from chemical precursors 
or other chemicals which are involved in the synthesis of 
the protein. Accordingly such preparations of Tango-7 7 
protein have less than about 30%, 20%, 10%, 5% (by dry 
weight) of chemical precursors or non-Tango- 77 chemicals. 

io Biologically active portions of a Tango-77 protein 

include peptides comprising amino acid sequences 
sufficiently identical to or derived from the amino acid 
sequence of the Tango-77 protein (e.g., the amino acid 
sequence shown in SEQ ID NO:2, SEQ ID NO: 5, SEQ ID NO : 7 , 

15 SEQ ID NO: 9, SEQ ID NO: 11, or SEQ ID NO: 13), which 

include fewer amino acids than the full length Tango-77 
proteins, and exhibit at least one activity of a Tango-77 
protein. Typically, biologically active portions 
comprise a domain or motif with at least one activity of 

20 the Tango-77 protein. A biologically active portion of a 
Tango-77 protein can be a polypeptide which is, for 
example, 10, 25, 50, 100 or more amino acids in length. 

Moreover, other biologically active portions, in 
which other regions of the protein are deleted, can be 

25 prepared by recombinant techniques and evaluated for one 
or more of the functional activities of a native Tango-77 
protein. 

Preferred Tango-77 protein has the amino acid 
sequence shown of SEQ ID NO:2, SEQ ID NO: 5, SEQ ID NO: 7, 

30 SEQ ID NO: 9, SEQ ID NO: 11, or SEQ ID NO: 13. Other useful 
Tango-77 proteins are substantially identical to SEQ ID 
NO: 2, SEQ ID NO : 5 , SEQ ID NO : 7 , SEQ ID NO : 9 , SEQ ID 
NO: 11, or SEQ ID NO: 13 and retain the functional activity 
of the protein of SEQ ID NO:2, SEQ ID NO: 5, SEQ ID NO : 7 , 

35 SEQ ID NO: 9, SEQ ID NO: 11, or SEQ ID NO: 13 yet differ in 
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amino acid sequence due to natural allelic variation or 
mutagenesis. Accordingly, a useful Tango-77 protein is a 
protein which includes an amino acid sequence at least 
about 45%, preferably 55%, 65%, 75%, 85%, 95%, or 99% 
5 identical to the amino acid sequence of SEQ ID NO: 2, SEQ 
ID NO: 5, SEQ ID NO : 7 , SEQ ID NO : 9 , SEQ ID NO: 11, or SEQ 
ID NO: 13 and retains the functional activity of the 
Tango-77 proteins of SEQ ID NO: 2, SEQ ID NO : 5 , SEQ ID 
NO: 7, SEQ ID NO : 9 , SEQ ID NO: 11, or SEQ ID NO: 13. In a 
io preferred embodiment, the Tango-77 protein retains a 
functional activity of the Tango-77 protein of SEQ ID 
NO:2, SEQ ID NO : 5 , SEQ ID NO : 7 , SEQ ID NO : 9 , SEQ ID 
NO: 11, or SEQ ID NO: 13. 

To determine the percent identity of two amino 
is acid sequences or of two nucleic acids, the sequences are 
aligned for optimal comparison purposes (e.g., gaps can 
be introduced in the sequence of a first amino acid or 
nucleic acid sequence for optimal alignment with a second 
amino or nucleic acid sequence) . The amino acid residues 
20 or nucleotides at corresponding amino acid positions or 
nucleotide positions are then compared. When a position 
in the first sequence is occupied by the same amino acid 
residue or nucleotide as the corresponding position in 
the second sequence, then the molecules are identical at 
25 that position. The percent identity between the two 
sequences is a function of the number of identical 
positions shared by the sequences (i.e., % identity = # 
of identical positions/total # of positions, e.g., 
overlapping x 100) . Preferably, the two sequences are 
30 the same length. 

The determination of percent homology between two 
sequences can be accomplished using a mathematical 
algorithm. A preferred, non-limiting example of a 
mathematical algorithm utilized for the comparison of two 
35 sequences is the algorithm of Karlin and Altschul (1990) 
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Proc. Natl. Acad. Sci. USA 87:2264-2268, modified as in 
Karlin and Altschul (1993) Proc. Natl. Acad. Sci. USA 
90:5873-5877. Such an algorithm is incorporated into the 
NBLAST and XBLAST programs of Altschul, et al . (1990) 
5 J". Mol. Biol. 215:403-410. BLAST nucleotide searches can 
be performed with the NBLAST program, score = 100, 
wordlength = 12 to obtain nucleotide sequences homologous 
to Tango- 77 nucleic acid molecules of the invention. 
BLAST protein searches can be performed with the XBLAST 

10 program, score = 50, wordlength = 3 to obtain amino acid 
sequences homologous to Tango- 77 protein molecules of the 
invention. To obtain gapped alignments for comparison 
purposes, Gapped BLAST can be utilized as described in 
Altschul et al. (1997) Nucleic Acids Res. 25:3389-3402. 

is When utilizing BLAST and Gapped BLAST programs, the 
default parameters of the respective programs (e.g., 
XBLAST and NBLAST) can be used. See 

http://www.ncbi.nlm.nih.gov. Another preferred, non- 
limiting example of a mathematical algorithm utilized for 
20 the comparison of sequences is the algorithm of Myers and 
Miller, CABIOS (1989) . Such an algorithm is incorporated 
into the ALIGN program (version 2.0) which is part of the 
GCG sequence alignment software package. When utilizing 
the ALIGN program for comparing amino acid sequences, a 

25 PAM12 0 weight residue table, a gap length penalty of 12, 
and a gap penalty of 4 can be used. 

The percent identity between two sequences can be 
determined using techniques similar to those described 
above, with or without allowing gaps. In calculating 

30 percent identity, only exact matches are counted. 

The invention also provides Tango- 77 chimeric or 
fusion proteins. As used herein, a Tango-77 "chimeric 
protein" or 11 fusion protein 11 comprises a Tango-77 
polypeptide operably linked to a non-Tango- 77 

35 polypeptide. A "Tango-77 polypeptide" refers to a 
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polypeptide having an amino acid sequence corresponding 
to Tango-77 polypeptides, whereas a "non -Tango- 77 
polypeptide" refers to a polypeptide having an amino acid 
sequence corresponding to a protein which is not 
5 substantially identical to the Tango-77 protein, e.g., a 
protein which is different from the Tango-77 protein and 
which is derived from the same or a different organism. 
Within a Tango-77 fusion protein the Tango-77 polypeptide 
can correspond to all or a portion of a Tango-77 protein, 

10 preferably at least one biologically active portion of a 
Tango-77 protein. Within the fusion protein, the term 
"operably linked" is intended to indicate that the 
Tango-77 polypeptide and the non-Tango-77 polypeptide are 
fused in- frame to each other. The non-Tango-77 

is polypeptide can be fused to the N- terminus or C- terminus 
of the Tango-77 polypeptide. 

One useful fusion protein is a GST-Tango-77 fusion 
protein in which the Tango-77 sequences are fused to the 
C-terminus of the GST sequences. Such fusion proteins 

20 can facilitate the purification of recombinant Tango-77. 

In another embodiment, the fusion protein is a 
Tango-77 protein containing a heterologous signal 
sequence at its N-terminus. For example, the native 
Tango-77 signal sequence (i.e., about amino acids 1 to 63 

25 of SEQ ID NO:2; SEQ ID NO : 4 ; or about amino acids 1 to 52 
of SEQ ID N0:7;SEQ ID NO : 8 ; or about amino acids 1 to 21 
of SEQ ID NO: 11; SEQ ID NO; 12) can be removed arid replaced 
with a signal sequence from another protein. In certain 
host cells (e.g., mammalian host cells), expression 

30 and/or secretion of Tango-77 can be increased through use 
of a heterologous signal sequence. For example, the gp6 7 
secretory sequence of the baculovirus envelope protein 
can be used as a heterologous signal sequence (Ausubel et 
al . , supra) . Other examples of eukaryotic heterologous 
35 signal sequences include the secretory sequences of 
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melittin and human placental alkaline phosphatase 
(Stratagene; La Jolla, California) . In yet another 
example, useful prokaryotic heterologous signal sequences 
include the phoA secretory signal {Sambrook et al . , 
5 supra) and the protein A secretory signal (Pharmacia 
Biotech; Piscataway, New Jersey) . 

In yet another embodiment, the fusion protein is 
an Tango- 77 -immunoglobulin fusion protein in which all or 
part of Tango- 77 is fused to sequences derived from a 
io member of the immunoglobulin protein family. The 

Tango- 77 -immunoglobulin fusion proteins of the invention 
can be incorporated into pharmaceutical compositions and 
administered to a subject to inhibit an interaction 
between a Tango-77 ligand and a Tango-77 receptor on the 
is surface of a cell, to thereby suppress Tango-77-mediated 
signal transduction in vivo. The Tango- 77 -immunoglobulin 
fusion proteins can be used to affect the bioavailability 
of a Tango-77 cognate ligand. Inhibition of the Tango-77 
ligand/Tango-77 interaction may be useful therapeutically 
20 for both the treatment of inflammatory and autoimmune 

disorders. Moreover, the Tango-77- immunoglobulin fusion 
proteins of the invention can be used as immunogens to 
produce anti-Tango-77 antibodies in a subject, to purify 
Tango-77 ligands and in screening assays to identify 
25 molecules which inhibit the interaction of Tango-77 with 
a Tango-77 receptor. 

Preferably, a Tango-77 chimeric or fusion protein 
of the invention is produced by standard recombinant DNA 
techniques. For example, DNA fragments coding for the 
30 different polypeptide sequences are ligated together in- 
frame in accordance with conventional techniques, for 
example by employing blunt -ended or stagger-ended termini 
for ligation, restriction enzyme digestion to provide for 
appropriate termini, filling-in of cohesive ends as 
35 appropriate, alkaline phosphatase treatment to avoid 
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undesirable joining, and enzymatic ligation. In another 
embodiment, the fusion gene can be synthesized by 
conventional techniques including automated DNA 
synthesizers. Alternatively, PCR amplification of gene 
5- fragments can be carried out using anchor primers which 
give rise to complementary overhangs between two 
consecutive gene fragments which can subsequently be 
annealed and reamplified to generate a chimeric gene 
sequence (see, e.g., Current Protocols in Molecular 
10 Biology, Ausubel et al . eds . , John Wiley & Sons: 1992). 
Moreover, many expression vectors are commercially 
available that already encode a fusion moiety (e.g., a 
GST polypeptide) . An Tango- 77 -encoding nucleic acid can 
be cloned into such an expression vector such that the 
15 fusion moiety is linked in-frame to the Tango-77 protein. 

The present invention also pertains to variants of 
the Tango-77 proteins (i.e., proteins having a sequence 
which differs from that of the Tango-77 amino acid 
sequence) . Such variants can function as either Tango-77 
20 agonists (mimetics) or as Tango-77 antagonists. Variants 
of the Tango-77 protein can be generated by mutagenesis, 
e.g., discrete point mutation or truncation of the 
Tango-77 protein. An agonist of the Tango-77 protein can 
retain substantially the same, or a subset, of the 
25 biological activities of the naturally occurring form of 
the Tango-7 7 protein. An antagonist of the Tango-77 
protein can inhibit one or more of the activities of the 
naturally occurring form of the Tango-77 protein by, for 
example, competitively binding to a downstream or 
30 upstream member of a cellular signaling cascade which 

includes the Tango-77 protein. Thus, specific biological 
effects can be elicited by treatment with a variant of 
limited function. Treatment of a subject with a variant 
having a subset of the biological activities of the 
35 naturally occurring form of the protein can have fewer 
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side effects in a subject relative to treatment with the 
naturally occurring form of the Tango-77 proteins. 

Variants of the Tango-77 protein which function as 
either Tango-77 agonists (mimetics) or as Tango-77 
5 antagonists can be identified by screening combinatorial 
libraries of mutants, e.g., truncation mutants, of the 
Tango-77 protein for Tango-77 protein agonist or 
antagonist activity. In one embodiment, a variegated 
library of Tango-77 variants is generated by 
10 combinatorial mutagenesis at the nucleic acid level and 
is encoded by a variegated gene library. A variegated 
library of Tango-77 variants can be produced by, for 
example, enzymatically ligating a mixture of synthetic 
oligonucleotides into gene sequences such that a 
is degenerate set of potential Tango-77 sequences is 

expressible as individual polypeptides, or alternatively, 
as a set of larger fusion proteins (e.g., for phage 
display) containing the set of Tango-77 sequences 
therein. There are a variety of methods which can be 
20 used to produce libraries of potential Tango-77 variants 
from a degenerate oligonucleotide sequence. Chemical 
synthesis of a degenerate gene sequence can be performed 
in an automatic DNA synthesizer, and the synthetic gene 
then ligated into an appropriate expression vector. Use 
25 of a degenerate set of genes allows for the provision, in 
one mixture, of all of the sequences encoding the desired 
set of potential Tango-77 sequences. Methods for 
synthesizing degenerate oligonucleotides are known in the 
art (see, e.g., Narang (1983) Tetrahedron 39:3; Itakura 
30 et al . (1984) Annu. Rev. Biochem. 53:323; Itakura et al . 
(1984) Science 198:1056; Ike et al . (1983) Nucleic Acid 
Res. 11:477) . 

In addition, libraries of fragments of the 
Tango-77 protein coding sequence can be used to generate 
35 a variegated population of Tango-77 fragments for 
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screening and subsequent selection of variants of a 
Tango- 77 protein. In one embodiment, a library of coding 
sequence fragments can be generated by treating a double 
stranded PCR fragment of a Tango-77 coding sequence with 
5 a nuclease under conditions wherein nicking occurs only 
about once per molecule, denaturing the double stranded 
DNA, renaturing the DNA to form double stranded DNA which 
can include sense/antisense pairs from different nicked 
products, removing single stranded portions from reformed 
10 duplexes by treatment with SI nuclease, and ligating the 
resulting fragment library into an expression vector. By 
this method, an expression library can be derived which 
encodes N- terminal and internal fragments of various 
sizes of the Tango-77 protein, 
is Several techniques are known in the art for 

screening gene products of combinatorial libraries made 
by point mutations or truncation, and for screening cDNA 
libraries for gene products having a selected property. 
Such techniques are adaptable for rapid screening of the 
20 gene libraries generated by the combinatorial mutagenesis 
of Tango-77 proteins. The most widely used techniques, 
which are amenable to high through-put analysis, for 
screening large gene libraries typically include cloning 
the gene library into replicable expression vectors, 
25 transforming appropriate cells with the resulting library 
of vectors, and expressing the combinatorial genes under 
conditions in which detection of a desired activity 
facilitates isolation of the vector encoding the gene 
whose product was detected. Recursive ensemble 
30 mutagenesis (REM) , a technique which enhances the 

frequency of functional mutants in the libraries, can be 
used in combination with the screening assays to identify 
Tango-77 variants (Arkin and Yourvan (1992) Proc . Natl. 
Acad. Sci. USA 89:7811-7815; Delgrave et al . (1993) 
35 Protein Engineering 6 (3) : 327-331) . 
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An isolated Tango- 77 protein, or a portion or 
fragment thereof, can be used as an immunogen to generate 
antibodies that bind Tango- 77 using standard techniques 
for polyclonal and monoclonal antibody preparation. The 
5 full-length Tango-77 protein can be used or, 

alternatively, the invention provides antigenic peptide 
fragments of Tango-77 for use as immunogens . The 
antigenic peptide of Tango-77 comprises at least 8 
(preferably 10, 15, 20, or 30) amino acid residues of the 
10 amino acid sequence shown in SEQ ID NO : 2 , SEQ ID NO: 5, 
SEQ ID NO: 7, SEQ ID NO : 9 , SEQ ID NO : 1 1 or SEQ ID NO: 13 
and encompasses an epitope of Tango-77 such that an 
antibody raised against the peptide forms a specific 
immune complex with Tango-77. 
is A Tango-77 immunogen typically is used to prepare 

antibodies by immunizing a suitable subject (e.g., 
rabbit, goat, mouse or other mammal) with the immunogen. 
An appropriate immunogenic preparation can contain, for 
example, recombinantly expressed Tango-77 protein or a 
20 chemically synthesized Tango-77 polypeptide. The 

preparation can further include an adjuvant, such as 
Freund's complete or incomplete adjuvant, or similar 
immunostimulatory agent. Immunization of a suitable 
subject with an immunogenic Tango-77 preparation induces 
25 a polyclonal anti -Tango- 77 antibody response. 

Accordingly, another aspect of the invention 
pertains to anti-Tango-77 antibodies. The term 
"antibody" as used herein refers to immunoglobulin 
molecules and immunologically active portions of 
30 immunoglobulin molecules, i.e., molecules that contain an 
antigen binding site which specifically binds an antigen, 
such as Tango-77. A molecule which specifically binds to 
Tango-77 is a molecule which binds Tango-77, but does not 
substantially bind other molecules in a sample, e.g., a 
35 biological sample, which naturally contains Tango-77. 
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Examples of immunologically active portions of 
immunoglobulin molecules include F (ab) and F(ab') 2 
fragments which can be generated by treating the antibody 
with an enzyme such as pepsin. The invention provides 
5 polyclonal and monoclonal antibodies that bind Tango-77. 
The term "monoclonal antibody" or "monoclonal antibody 
composition" , as used herein, refers to a population of 
antibody molecules that contain only one species of an 
antigen binding site capable of immunoreacting with a 
10 particular epitope of Tango-77. A monoclonal antibody 
composition thus typically displays a single binding 
affinity for a particular Tango-77 protein with which it 
immunoreacts . 

Polyclonal anti-Tango-77 antibodies can be 
is prepared as described above by immunizing a suitable 
subject with a Tango-77 immunogen. The anti-Tango-77 
antibody titer in the immunized subject can be monitored 
over time by standard techniques, such as with an enzyme 
linked immunosorbent assay (EL.ISA) using immobilized 
20 Tango-77. If desired, the antibody molecules directed 
against Tango-77 can be isolated from the mammal (e.g., 
from the blood) and further purified by well-known 
techniques, such as protein A chromatography to obtain 
the IgG fraction. At an appropriate time after 
25 immunization, e.g. , when the anti-Tango-77 antibody 
titers are highest, antibody-producing cells can be 
obtained from the subject and used to prepare monoclonal 
antibodies by standard techniques, such as the hybridoma 
technique originally described by Kohler and Milstein 
30 (1975) Nature 256:495-497, the human B cell hybridoma 

technique (Kozbor et al . (1983) Immunol Today 4:72), the 
EBV-hybridoma technique (Cole et al . (1985), Monoclonal 
Antibodies and Cancer Therapy, Alan R. Liss, Inc., pp. 
77-96) or trioma techniques. The technology for 
35 producing hybridomas is well known (see generally Current 
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Protocols in Immunology (1994) Coligan et al . (eds. ) John 
Wiley Sc Sons, Inc., New York, NY). Briefly, an immortal 
cell line (typically a myeloma) is fused to lymphocytes 
(typically splenocytes) from a mammal immunized with a 
5 Tango- 77 immunogen as described above, and the culture 
supernatants of the resulting hybridoma cells are 
screened to identify a hybridoma producing a monoclonal 
antibody that binds Tango-77. 

Any of the many well known protocols used for 
10 fusing lymphocytes and immortalized cell lines can be 
applied for the purpose of generating an anti-Tango-77 
monoclonal antibody (see, e.g., Current Protocols in 
Immunology, supra; Galfre et al . (1977) Nature 266:55052; 
R.H. Kenneth, in Monoclonal Antibodies : A New Dimension 
15 In Biological Analyses, Plenum Publishing Corp., New 
York, New York (1980); and Lerner (1981) Yale <J. Biol. 
Med. f 54:3 87-402. Moreover, the ordinarily skilled 
worker will appreciate that there are many variations of 
such methods which also would be useful. Typically, the 
20 immortal cell line (e.g., a myeloma cell line) is derived 
from the same mammalian species as the lymphocytes. For 
example, murine hybridomas can be made by fusing 
lymphocytes from a mouse immunized with an immunogenic 
preparation of the present invention with an immortalized 
25 mouse cell line, e.g., a myeloma cell line that is 
sensitive to culture medium containing hypoxanthine, 
aminopterin and thymidine ("HAT medium") . Any of a 
number of myeloma cell lines can be used as a fusion 
partner according to standard techniques, e.g., the P3- 
30 NSl/l-Ag4-l, P3-x63-Ag8 . 653 or Sp2/0-Agl4 myeloma lines. 
These myeloma lines are available from ATCC. Typically, 
HAT- sensitive mouse myeloma cells are fused to mouse 
splenocytes using polyethylene glycol ("PEG"). Hybridoma 
cells resulting from the fusion are then selected using 
35 HAT medium, which kills unfused and unproductively fused 
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myeloma cells (unf used splenocytes die after several days 
because they are not transformed) . Hybridoma cells 
producing a monoclonal antibody of the invention are 
detected by screening the hybridoma culture supernatants 
5 for antibodies that bind Tango-77, e.g., using a standard 
ELISA assay. 

Alternative to preparing monoclonal antibody- 
secreting hybridomas, a monoclonal ant i -Tango-77 antibody 
can be identified and isolated by screening a recombinant 

10 combinatorial immunoglobulin library (e.g., an antibody 
phage display library) with Tango-77 to thereby isolate 
immunoglobulin library members that bind Tango-77. Kits 
for generating and screening phage display libraries are 
commercially available (e.g., the Pharmacia .Recombinant 

15 Phage Antibody System, Catalog No. 27-9400-01; and the 
Stratagene Surf ZAP™ Phage Display Kit, Catalog No. 
240612) . Additionally, examples of methods and reagents 
particularly amenable for use in generating and screening 
antibody display library can be found in, for example, 

20 U.S. Patent No. 5,223,409; PCT Publication No. WO 
92/18619; PCT Publication No. WO 91/17271; PCT 
Publication No. WO 92/2 0 791; PCT Publication No. WO 
92/15679; PCT Publication No. WO 93/01288; PCT 
Publication No. WO 92/01047; PCT Publication No. WO 

25 92/09690; PCT Publication No. WO 90/02809; Fuchs et al . 
(1991) Bio/Technology 9:1370-1372; Hay et al . (1992) Hum. 
Antibod. Hybridomas 3:81-85; Huse et al . (1989) Science 
246:1275-1281; Griffiths et al . (1993) EMBO J 12:725-734. 
Additionally , recombinant ant i -Tango- 77 

30 antibodies, such as chimeric and humanized monoclonal 

antibodies, comprising both human and non-human portions, 
which can be made using standard recombinant DNA 
techniques, are within the scope of the invention. Such 
chimeric and humanized monoclonal antibodies can be 

35 produced by recombinant DNA techniques known in the art, 
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for example using methods described in PCT Publication 
No. WO 87/02671; European Patent Application 184,187; 
European Patent Application 171,4 96; European Patent 
Application 173,494; PCT Publication No. WO 86/01533; 
5 U.S. Patent No. 4,816,567; European Patent Application 
125,023; Better et al . (1988) Science 240:1041-1043; Liu 
et al. (1987) Proc. Natl. Acad. Sci. USA 84:3439-3443; 
Liu et al. (1987) J". Immunol. 139:3521-3526; Sun et al . 
(1987) Proc. Natl. Acad. Sci. USA 84:214-218; Nishimura 
10 et al . (1987) Cane. Res. 47:999-1005; Wood et al . (1985) 
Nature 314:446-449; and Shaw et al . (1988) J". Natl. 
Cancer Inst. 80:1553-1559); Morrison (1985) Science 
229:1202-1207; Oi et al . (1986) Bio/Techniques 4:214; 
U.S. Patent 5,225,53 9; Jones et al . (1986) Nature 
15 321:552-525; Verhoeyan et al . (1988) Science 239:1534; 
and Beidler et al . (1988) J. Immunol. 141:4053-4060. 

Completely human antibodies are particularly 
desirable for therapeutic treatment of human patients. 
Such antibodies can be produced using transgenic mice 
20 which are incapable of expressing endogenous 

immunoglobulin heavy and light chains genes, but which 
can express human heave and light chain genes. The 
transgenic mice are immunized in the normal fashion with 
a selected antigen, e.g., all or a portion of Tango-77. 
25 Monoclonal antibodies directed against the antigen can be 
obtained using conventional hybridoma technology. The 
human immunoglobulin transgenes harbored by the 
transgenic mice rearrange during B cell differentiation, 
and subsequently undergo class switching and somatic 
30 mutation. Thus, using such a technique, it is possible 
to produce therapeutically useful IgG, IgA and IgE 
antibodies. For an overview of this technology for 
producing human antibodies, see Lonberg and Huszar (1995, 
Int. .Rev. Immunol. 13:65.-93). For a detailed discussion 
35 of this technology for producing human antibodies and 
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human monoclonal antibodies and protocols for producing 
such antibodies, see, e.g., U.S. Patent 5,625,126; U.S. 
Patent 5,633,425; U.S. Patent 5,569,825; U.S. Patent 
5,661,016; and U.S. Patent 5,545,806. In addition, 
5 companies such as Abgenix, Inc. (Freemont, CA) , can be 
engaged to provide human antibodies directed against a 
selected antigen using technology similar to the 
described above. 

Completely human antibodies which recognize a 
10 selected epitope can be generated using a technique 

referred to as "guided selection. 11 In this approach a 
selected non-human monoclonal antibody, e.g., a murine 
antibody, is used to guide the selection of a completely 
human antibody recognizing the same epitope, 
is First, a non-human monoclonal antibody which binds 

a selected antigen (epitope), e.g., an antibody which 
inhibits Tango- 77 activity, is identified. The heave 
chain and the light chain of the non-human antibody are 
cloned and used to create phage display Fab fragments. 
20 For example, the heave chain gene can be cloned into a 
plasmid vector so that the heavy chain can be secreted 
from bacteria. The light chain gene can be cloned into a 
phage coat protein gene so that the light chain can be 
expressed on the surface of phage. A repertoire (random 
25 collection) of human light chains fused to phage is used 
to infect the bacteria which express the non- human heavy 
chain. The resulting progeny phage display hybrid 
antibodies (human light chain/non-human heavy chain) . 
The selected antigen is used in a panning screen to 
30 select phage which bind the selected antigen. Several 
rounds of selection may be required to identify such 
phage. Next, human light chain genes are isolated from 
the selected phage which bind the selected antigen. 
These selected human light chain genes are then used to 
35 guide the selection of human heavy chain genes as 



SS06426A1 I > 



WO 99/06426 



PCT/US98/16102 



- 42 - 

follows. The selected human light chain genes are 
inserted into vectors for expression by bacteria. 
Bacteria expressing the selected human light chains are 
infected with a repertoire of human heavy chains fused to 
5 phage. The resulting progeny phage display human 
antibodies (human light chain/human heavy chain) . 

Next, the selected antigen is used in a panning 
screen to select phage which bind the selected antigen. 
The phage selected in this step display completely human 

10 antibody which recognize the same epitope recognized by 
the original selected, non-human monoclonal antibody. 
The genes encoding both the heavy and light chains are 
readily isolated and be further manipulated for 
production of human antibody. This technology is 

15 described by Jespers et al . (1994, Bio/ technology 12:899- 
903) . 

An anti-Tango-77 antibody (e.g., monoclonal 
antibody) can be used to isolate Tango- 77 by standard 
techniques, such as affinity chromatography or 
20 immunoprecipitation. An anti-Tango-77 antibody can 
facilitate the purification of natural Tango-77 from 
cells and of recombinantly produced Tango-77 expressed in 
host cells. Moreover, an anti-Tango-7 7 antibody can be 
used to detect Tango-77 protein (e.g., in a cellular 

25 lysate or cell supernatant) in order to evaluate the 
abundance and pattern of expression of the Tango-77 
protein. Anti-Tango-77 antibodies can be used 
diagnostically to monitor protein levels in tissue as 
part of a clinical testing procedure, e.g., to, for 

30 example, determine the efficacy of a given treatment 
regimen. Detection can be facilitated by coupling the 
antibody to a detectable substance . Examples of 
detectable substances include various enzymes, prosthetic 
groups, fluorescent materials, luminescent materials, 

35 bioluminescent materials , and radioactive materials . 
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Examples of suitable enzymes include horseradish 
peroxidase, alkaline phosphatase, )3-galactosidase , or 
acetylcholinesterase; examples of suitable prosthetic 
group complexes include streptavidin/biotin and 
5 avidin/biotin; examples of suitable fluorescent materials 
include umbellif erone, fluorescein, fluorescein 
isothiocyanate , rhodamine , dichlorotriazinylamine 
fluorescein, dansyl chloride or phycoerythrin; an example 
of a luminescent material includes luminol ; examples of 
10 bioluminescent materials include luciferase, luciferin, 
and aequorin, and examples of suitable radioactive 
material include 125 I, 131 I, 35 S or 3 H. 

III. Recombinant Expression Vectors and Host Cells 
Another aspect of the invention pertains to 
is vectors, preferably expression vectors, containing a 
nucleic acid molecule encoding Tango- 77 (or a portion 
thereof) . As used herein, the term "vector" refers to a 
nucleic acid molecule capable of transporting another 
nucleic acid to which it has been linked. One type of 
20 vector is a "plasmid" , which refers to a circular double 
stranded DNA loop into which additional DNA segments can 
be ligated. Another type of vector is a viral vector, 
wherein additional DNA segments can be ligated into the 
viral genome . Certain vectors are capable of autonomous 
25 replication in a host cell into which they are introduced 
(e.g., bacterial vectors having a bacterial origin of 
replication and episomal mammalian vectors) . Other 
vectors (e.g., non-episomal mammalian vectors) are 
integrated into the genome of a host cell upon 
30 introduction into the host cell, and thereby are 

replicated along with the host genome. Moreover, certain 
vectors, expression vectors, are capable of directing the 
expression of genes to which they are operably linked. 
In general, expression vectors of utility in recombinant 
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DNA techniques are often in the form of plasmids 
(vectors) . However, the invention is intended to include 
such other forms of expression vectors, such as viral 
vectors (e.g., replication defective retroviruses, 
5 adenoviruses and adeno-associated viruses) , which serve 
equivalent functions. 

The recombinant expression vectors of the 
invention comprise a nucleic acid of the invention in a 
form suitable for expression of the nucleic acid in a 
10 host cell, which means that the recombinant expression 
vectors include one or more regulatory sequences, 
selected on the basis of the host cells to be used for 
expression, which is operably linked to the nucleic acid 
sequence to be expressed. Within a recombinant 
is expression vector, "operably linked" is intended to mean 
that the nucleotide sequence of interest is linked to the 
regulatory sequence (s) in a manner which allows for 
expression of the nucleotide sequence (e.g., in an in 
vitro transcription/translation system or in a host cell 
20 when the vector is introduced into the host cell) . The 
term "regulatory sequence" is intended to include 
promoters, enhancers and other expression control 
elements (e.g., polyadenylation signals). Such 
regulatory sequences are described, for example, in 
25 Goeddel; Gene Expression Technology: Methods in 

Enzymology 185, Academic Press, San Diego, CA (1990) . 
Regulatory sequences include those which direct 
constitutive expression of a nucleotide sequence in many 
types of host cell and those which direct expression of 
30 the nucleotide sequence only in certain host cells (e.g., 
tissue-specific regulatory sequences) . It will be 
appreciated by those skilled in the art that the design 
of the expression vector can depend on such factors as 
the choice of the host cell to be transformed, the level 
35 of expression of protein desired, etc. The expression 
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vectors of the invention can be introduced into host 
cells to thereby produce proteins or peptides, including 
fusion proteins or peptides, encoded by nucleic acids as 
described herein (e.g., Tango- 77 proteins , mutant forms 
5 of Tango- 77, fusion proteins, etc.) . 

The recombinant expression vectors of the 
invention can be designed for expression of Tango-77 in 
prokaryotic or eukaryotic cells, e.g., bacterial cells 
such as E~ coli, insect cells (using baculovirus 

10 expression vectors), yeast cells or mammalian cells. 
Suitable host cells are discussed further in Goeddel, 
Gene Expression Technology: Methods in Enzymology 185, 
Academic Press, San Diego, CA (1990). Alternatively, the 
recombinant expression vector can be transcribed and 

is translated in vitro, for example using T7 promoter 
regulatory sequences and T7 polymerase. 

Expression of proteins in prokaryotes is most 
often carried out in E. coli with vectors containing 
constitutive or inducible promoters directing the 

20 expression of either fusion or non- fusion proteins. 

Fusion vectors add a number of amino acids to a protein 
encoded therein, usually to the amino terminus of the 
recombinant protein. Such fusion vectors typically serve 
three purposes: 1) to increase expression of recombinant 

25 protein; 2) to increase the solubility of the recombinant 
protein; and 3) to aid in the purification of the 
recombinant protein by acting as a ligand in affinity 
purification. Often, in fusion expression vectors, a 
proteolytic cleavage site is introduced at the junction 

30 of the fusion moiety and the recombinant protein to 
enable separation of the recombinant protein from the 
fusion moiety subsequent to purification of the fusion 
protein. Such enzymes, and their cognate recognition 
sequences, include Factor Xa, thrombin and enterokinase . 

35 Typical fusion expression vectors include pGEX (Pharmacia 



£90S426A1_L> 



WO 99/06426 



PCT7US98/16102 



- 46 - 

Biotech Inc; Smith and Johnson (1988) Gene 67:31-40), 
pMAXi (New England Biolabs, Beverly, MA) and pRIT5 
(Pharmacia, Piscataway, NJ) which fuse glutathione S- 
transf erase (GST) , maltose E binding protein, or protein 
5 A, respectively, to the target recombinant protein. 

Examples of suitable inducible non- fusion E. coli 
expression vectors include pTrc (Amann et al . (1988) Gene 
69:301-315) and pET lid (Studier et al . , Gene Expression 
Technology: Methods in Enzymology 185, Academic Press, 
io San Diego, California (1990) 60-89) . Target gene 
expression from the pTrc vector relies on host RNA 
polymerase transcription from a hybrid trp-lac fusion 
promoter. Target gene expression from the pET lid vector 
relies on transcription from a T7 gnlO-lac fusion 
is promoter mediated by a coexpressed viral RNA polymerase 
(T7 gnl) . This viral polymerase is supplied by host 
strains BL21(DE3) or HMS174(DE3) from a resident X 
prophage harboring a T7 gnl gene under the 
transcriptional control of the lacUV 5 promoter. 
20 One strategy to maximize recombinant protein 

expression in E. coli is to express the protein in a host 
bacteria with an impaired capacity to proteolytically 
cleave the recombinant protein (Gottesman, Gene 
Expression Technology: Methods in Enzymology 185, 
25 Academic Press, San Diego, California (1990) 119-128) . 

Another strategy is to alter the nucleic acid sequence of 
the nucleic acid to be inserted into an expression vector 
so that the individual codons for each amino acid are 
those preferentially utilized in E. coli (Wada et al . 
30 (1992) Nucleic Acids Res. 20:2111-2118). Such alteration 
of nucleic acid sequences of the invention can be carried 
out by standard DNA synthesis techniques. 

In another embodiment, the Tango-77 expression 
vector is a yeast expression vector. Examples of vectors 
35 for expression in yeast S. cerivisae include pYepSecl 
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(Baldari et al . (1987) EMBOJ. 6:229-234), pMFa (Kurjan 
and Herskowitz (1982) Cell 30:933-943), pJRY88 (Schultz 
et al . (1987) Gene 54:113-123), pYES2 (Invitrogen 
Corporation, San Diego, CA) , and picZ (InVitrogen Corp, 
5 San Diego, CA) . 

Alternatively, Tango- 77 can be expressed in insect 
cells using baculovirus expression vectors. Baculovirus 
vectors available for expression of proteins in cultured 
insect cells (e.g., Sf 9 cells) include the pAc series 
10 (Smith et al . (1983) Mol . Cell Biol. 3:2156-2165) and the 
pVL series (Lucklow and Summers (1989) Virology 170:31- 
39) . 

In yet another embodiment, a nucleic acid of the 
invention is expressed in mammalian cells using a 

15 mammalian expression vector. Examples of mammalian 
expression vectors include pCDM8 (Seed (1987) Nature 
329:840) and pMT2PC (Kaufman et al . (1987) EMBO J\ 6:187- 
195) . When used in mammalian cells, the expression 
vector's control functions are often provided by viral 

20 regulatory elements. For example, commonly used 
promoters are derived from polyoma, Adenovirus 2, 
cytomegalovirus and Simian Virus 40. For other suitable 
expression systems for both prokaryotic and eukaryotic 
cells see chapters 16 and 17 of Sambrook et al . (supra). 

25 In another embodiment, the recombinant mammalian 

expression vector is capable of directing expression of 
the nucleic acid preferentially in a particular cell type 
(e.g., tissue-specific regulatory elements are used to 
express the nucleic acid) . Tissue-specific regulatory 

30 elements are known in the art. Non-limiting examples of 
suitable tissue-specific promoters include the albumin 
promoter (liver-specific; Pinkert et al . (1987) Genes 
Dev. 1:268-277), lymphoid-specif ic promoters (Calame and 
Eaton (1988) Adv. Immunol. 43:235-275), in particular 
35 promoters of T cell receptors (Winoto and Baltimore 
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(1989) EMBO J. 8:729-733) and immunoglobulins (Banerji et 
al. (1983) Cell 33:729-740; Queen and Baltimore (1983) 
Cell 33:741-748), neuron-specific promoters (e.g., the 
neurofilament promoter; Byrne and Ruddle (1989) Proc . 

5 Natl. Acad. Sci. USA 86:5473-5477), pancreas - specif ic 

promoters (Edlund et al . (1985) Science 230:912-916), and 
mammary gland-specific promoters (e.g., milk whey 
promoter; U.S. Patent No. 4,873,316 and European 
Application Publication No. 264,166). Developmentally- 
10 regulated promoters are also encompassed, for example the 
murine hox promoters (Kessel and Gruss (1990) Science 
249:374-379) and the or- fetoprotein promoter (Campes and 
Tilghman (1989) Genes Dev. 3:537-546). 

The invention further provides a recombinant 

15 expression vector comprising a DNA molecule of the 
invention cloned into the expression vector in an 
antisense orientation. That is, the DNA molecule is 
operably linked to a regulatory sequence in a manner 
which allows for expression (by transcription of the DNA 

20 molecule) of an RNA molecule which is antisense to 

Tango- 77 mRNA. Regulatory sequences operably linked to a 
nucleic acid cloned in the antisense orientation can be 
chosen which direct the continuous expression of the 
antisense RNA molecule in a variety of cell types, for 

25 instance viral promoters and/or enhancers, or regulatory 
sequences can be chosen which direct constitutive, tissue 
specific or cell type specific expression of antisense 
RNA. The antisense expression vector can be in the form 
of a recombinant plasmid, phagemid or attenuated virus in 

30 which antisense nucleic acids are produced under the 
control of a high efficiency regulatory region, the 
activity of which can be determined by the cell type into 
which the vector is introduced. For a discussion of the 
regulation of gene expression using antisense genes see 
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Weintraub et al . (Reviews - Trends in Genetics, Vol. 1(1) 
1986) . 

Another aspect of the invention pertains to host 
cells into which a recombinant expression vector of the 
5 invention has been introduced. The terms "host cell 11 and 
"recombinant host cell" are used interchangeably herein. 
It is understood that such terms refer not only to the 
particular subject cell but to the progeny or potential 
progeny of such a cell. Because certain modifications 
10 may occur in succeeding generations due to either 

mutation or environmental influences, such progeny may 
not, in fact, be identical to the parent cell, but are 
still included within the scope of the term as used 
herein. 

is A host cell can be any prokaryotic or eukaryotic 

cell. For example, Tango-77 protein can be expressed in 
bacterial cells such as E. coli , insect cells, yeast or 
mammalian cells (such as Chinese hamster ovary cells 
(CHO) or COS cells) . Other suitable host cells are known 
20 to those skilled in the art. 

Vector DNA can be introduced into prokaryotic or 
eukaryotic cells via conventional transformation or 
transfection techniques. As used herein, the terms 
"transformation" and "transfection" are intended to refer 
25 to a variety of art -recognized techniques for introducing 
foreign nucleic acid (e.g., DNA) into a host cell, 
including calcium phosphate or calcium chloride co- 
precipitation, DEAE-dextran-mediated transfection, 
lipofection, or electroporation . Suitable methods for 
30 transforming or transfecting host cells can be found in 
Sambrook, et al . (supra), and other laboratory manuals. 

For stable transfection of mammalian cells, it is 
known that, depending upon the expression vector and 
transfection technique used, only a small fraction of 
35 cells may integrate the foreign DNA into their genome. 
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In order to identify and select these integrants, a gene 
that encodes a selectable marker (e.g., for resistance to 
antibiotics) is generally introduced into the host cells 
along with the gene of interest. Preferred selectable 
5 markers include those which confer resistance to drugs, 
such as G418, hygromycin and methotrexate. Nucleic acid 
encoding a selectable marker can be introduced into a 
host cell on the same vector as that encoding Tango- 7 7 or 
can be introduced on a separate vector. Cells stably 
10 transfected with the introduced nucleic acid can be 
identified by drug selection (e.g., cells that have 
incorporated the selectable marker gene will survive, 
while the other cells die) . 

A host cell of the invention, such as a 
is prokaryotic or eukaryotic host cell in culture, can be 
used to produce (i.e., express) Tango-77 protein. 
Accordingly, the invention further provides methods for 
producing Tango-77 protein using the host cells of the 
invention. In one embodiment, the method comprises 
20 culturing the host cell of invention (into which a 

recombinant expression vector encoding Tango-77 has been 
introduced) in a suitable medium such that Tango-77 
protein is produced. In another embodiment, the method 
further comprises isolating Tango-77 from the medium or 
25 the host cell. 

The host cells of the invention can also be used 
to produce nonhuman transgenic animals. For example, in 
one embodiment, a host cell of the invention is a 
fertilized oocyte or an embryonic stem cell into which 
30 Tango-77 -coding sequences have been introduced. Such 
host cells can then be used to create non-human 
transgenic animals in which exogenous Tango-77 sequences 
have been introduced into their genome or homologous 
recombinant animals in which endogenous Tango-77 
35 sequences have been altered. Such animals are useful for 
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studying the function and/or activity of Tango- 77 and for 
identifying and/or evaluating modulators of Tango-77 
activity. As used herein, a "transgenic animal" is a 
non-human animal, preferably a mammal, more preferably a 
5 rodent such as a rat or mouse, in which one or more of 
the cells of the animal includes a transgene. Other 
examples of transgenic animals include non-human 
primates, sheep, dogs, cows, goats, chickens, amphibians, 
etc. A transgene is exogenous DNA which is integrated 
10 into the genome of a cell from which a transgenic animal 
develops and which remains in the genome of the mature 
animal, thereby directing the expression of an encoded 
gene product in one or more cell types or tissues of the 
transgenic animal. As used herein, an "homologous 
is recombinant animal" is a non-human animal, preferably a 
mammal, more preferably a mouse, in which an endogenous 
Tango- 77 gene has been altered by homologous 
recombination between the endogenous gene and an 
exogenous DNA molecule introduced into a cell of the 
20 animal, e.g., an embryonic cell of the animal, prior to 
development of the animal . 

A transgenic animal of the invention can be 
created by introducing Tango-77-encoding nucleic acid 
into the male pronuclei of a fertilized oocyte, e.g., by 
25 microinjection, retroviral infection, and allowing the 
oocyte to develop in a pseudopregnant female foster 
animal. The Tango-77 cDNA sequence e.g., that of (SEQ ID 
NO:l, SEQ ID NO : 3 , SEQ ID NO:6; SEQ ID NO:10 or the cDNA 
of ATCC 98807) can be introduced as a transgene into the 
30 genome of a non-human animal. Alternatively, a nonhuman 
homologue of the human Tango-77 gene, such as a mouse 
Tango-77 gene, can be isolated based on hybridization to 
the human Tango-77 cDNA and used as a transgene. 
Intronic sequences and polyadenylation signals can also 
35 be included in the transgene to increase the efficiency 
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of expression of the transgene. A tissue- specific 
regulatory sequence (s) can be operably linked to the 
Tango- 77 transgene to direct expression of Tango- 77 
protein to particular cells. Methods for generating 
5 transgenic animals via embryo manipulation and 

microinjection, particularly animals such as mice, have 
become conventional in the art and are described, for 
example, in U.S. Patent Nos. 4,736,866 and 4,870,009, 
U.S. Patent No. 4,873,191 and in Hogan, Manipulating the 
10 Mouse Embryo (Cold Spring Harbor Laboratory Press, Cold 
Spring Harbor, N.Y., 1986). Similar methods are used for 
production of other transgenic animals. A transgenic 
founder animal can be identified based upon the presence 
of the Tango- 77 transgene in its genome and/or expression 
15 of Tango-77 mRNA in tissues or cells of the animals. A 
transgenic founder animal can then be used to breed 
additional animals carrying the transgene. Moreover, 
transgenic animals carrying a transgene encoding Tango-77 
can further be bred to other transgenic animals carrying 
20 other transgenes. 

To create an homologous recombinant animal , a 
vector is prepared which contains at least a portion of a 
Tango-77 gene (e.g., a human or a non-human homolog of 
the Tango-77 gene, e.g., a murine Tango-77 gene) into 
25 which a deletion, addition or substitution has been 

introduced to thereby alter, e.g., functionally disrupt, 
the Tango-77 gene. In a preferred embodiment, the vector 
is designed such that, upon homologous recombination, the 
endogenous Tango-77 gene is functionally disrupted (i.e., 
30 no longer encodes a functional protein; also referred to 
as a "knock out" vector) . Alternatively, the vector 

can be designed such that, upon homologous recombination, 
the endogenous Tango-77 gene is mutated or otherwise 
altered but still encodes functional protein (e.g., the 
35 upstream regulatory region can be altered to thereby 
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alter the expression of the endogenous Tango-77 protein) . 
In the homologous recombination vector, the altered 
portion of the Tango-77 gene is flanked at its 5' and 3' 
ends by additional nucleic acid of the Tango-77 gene to 
5 allow for homologous recombination to occur between the 
exogenous Tango-77 gene carried by the vector and an 
endogenous Tango-77 gene in an embryonic stem cell. The 
additional flanking Tango-77 nucleic acid is of 
sufficient length for successful homologous recombination 
10 with the endogenous gene. Typically, several kilobases 
of flanking DNA (both at the 5' and 3' ends) are included 
in the vector (see, e.g., Thomas and Capecchi (1987) Cell 
51:503 for a description of homologous recombination 
vectors) . The vector is introduced into an embryonic 
is stem cell line (e.g., by electroporation) and cells in 
which the introduced Tango-77 gene has homologously 
recombined with the endogenous Tango-77 gene are selected 
(see, e.g., Li et al . (1992) Cell 69:915). The selected 
cells are then injected into a blastocyst of an animal 
20 (e.g., a mouse) to form aggregation chimeras (see, e.g., 
Bradley in Teratocarcinomas and Embryonic Stem Cells: A 
Practical Approach, Robertson, ed. (IRL, Oxford, 1987) 
pp. 113-152) . A chimeric embryo can then be implanted 
into a suitable pseudopregnant female foster animal and 
25 the embryo brought to term. Progeny harboring the 

homologously recombined DNA in their germ cells can be 
used to breed animals in which all cells of the animal 
contain the homologously recombined DNA by germline 
transmission of the transgene. Methods for constructing 
30 homologous recombination vectors and homologous 

recombinant animals are described further in Bradley 
(1991) Current Opinion in Bio/Technology 2:823-829 and in 
PCT Publication Nos . WO 90/11354, WO 91/01140, WO 
92/0968, and WO 93/04169. 
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In another embodiment, transgenic non-human 
animals can be produced which contain selected systems 
which allow for regulated expression of the transgene. 
One example of such a system is the cre/loxP recombinase 
5 system of bacteriophage PI. For a description of the 
cre/loxP recombinase system, see, e.g., Lakso et al . 
(1992) Proc. Natl. Acad. Sci . USA 89:6232-6236. Another 
example of a recombinase system is the FLP recombinase 
system of Saccharomyces cerevisiae (0' Gorman et al . 
10 (1991) Science 251:1351-1355. If a cre/loxP recombinase 
system is used to regulate expression of the transgene, 
animals containing transgenes encoding both the Cre 
recombinase and a selected protein are required. Such 
animals can be provided through the construction of 
is "double" transgenic animals, e.g., by mating two 

transgenic animals, one containing a transgene encoding a 
selected protein and the other containing a transgene 
encoding a recombinase . 

Clones of the non-human transgenic animals 
20 described herein can also be produced according to the 

methods described in Wilmut et al . (1997) Nature 385:810- 
813 and PCT Publication Nos . WO 97/07668 and WO 97/07669. 
In brief, a cell, e.g., a somatic cell, from the 
transgenic animal can be isolated and induced to exit the 
25 growth cycle and enter G 0 phase. The quiescent cell can 
then be fused, e.g., through the use of electrical 
pulses, to an enucleated oocyte from an animal of the 
same species from which the quiescent cell is isolated. 
The reconstructed oocyte is then cultured such that it 
30 develops to morula or blastocyte and then transferred to 
pseudopregnant female foster animal. The offspring borne 
of this female foster animal will be a clone of the 
animal from which the cell, e.g., the somatic cell, is 
isolated . 
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IV. Pharmaceutical Compositions 

The Tango-77 nucleic acid molecules, Tango-77 
proteins, and anti-Tango-77 antibodies (also referred to 
herein as "active compounds") of the invention can be 
5 incorporated into pharmaceutical compositions suitable 
for administration. Such compositions typically comprise 
the nucleic acid molecule, protein, or antibody and a 
pharmaceutically acceptable carrier. As used herein the 
language "pharmaceutically acceptable carrier" is 
10 intended to include any and all solvents, dispersion 
media, coatings, antibacterial and antifungal agents, 
isotonic and absorption delaying agents, and the like, 
compatible with pharmaceutical administration. The use 
of such media and agents for pharmaceutically active 
is substances is. well known in the art. Except insofar as 
any conventional media or agent is incompatible with the 
active compound, use thereof in the compositions is 
contemplated. Supplementary active compounds can also be 
incorporated into the compositions. 
20 A pharmaceutical composition of the invention is 

formulated to be compatible with its intended route of 
administration. Examples of routes of administration 
include parenteral, (e.g. intravenous, intradermal, 
subcutaneous) (e.g., oral inhalation), transdermal 
25 (topical), transmucosal , and rectal administration. 
Solutions or suspensions used for parenteral, 
intradermal, or subcutaneous application can include the 
following components: a sterile diluent such as water for 
injection, saline solution, fixed oils, polyethylene 
30 glycols, glycerine, propylene glycol or other synthetic 
solvents; antibacterial agents such as benzyl alcohol or 
methyl parabens; antioxidants such as ascorbic acid or 
sodium bisulfite; chelating agents such as 
ethylenediaminetetraacetic acid; buffers such as 
35 acetates, citrates or phosphates and agents for the 
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adjustment of tonicity such as sodium chloride or 
dextrose. pH can be adjusted with acids or bases, such 
as hydrochloric acid or sodium hydroxide. The parenteral 
preparation can be enclosed in ampoules, disposable 
5 syringes or multiple dose vials made of glass or plastic. 
Pharmaceutical compositions suitable for 
injectable use include sterile aqueous solutions (where 
water soluble) or dispersions and sterile powders for the 
extemporaneous preparation of sterile injectable 
10 solutions or dispersions. For intravenous 

administration, suitable carriers include physiological 
saline, bacteriostatic water, Cremophor EL™ (BASF; 
Parsippany, NJ) or phosphate buffered saline (PBS) . In 
all cases, the composition must be sterile and should be 
15 fluid to the extent that easy syringability exists. It 
must be stable under the conditions of manufacture and 
storage and must be preserved against the contaminating 
action of microorganisms such as bacteria and fungi. The 
carrier can be a solvent or dispersion medium containing, 
20 for example, water, ethanol, polyol (for example, 

glycerol, propylene glycol, and liquid polyetheylene 
glycol, and the like), and suitable mixtures thereof. 
The proper fluidity can be maintained, for example, by 
the use of a coating such as lecithin, by the maintenance 
25 of the required particle size in the case of dispersion 
and by the use of surfactants. Prevention of the action 
of microorganisms can be achieved by various 
antibacterial and antifungal agents, for example, 
parabens, chlorobutanol , phenol, ascorbic acid, 
30 thimerosal, and the like. In many cases, it will be 
preferable to include isotonic agents, for example, 
sugars, polyalcohols such as mannitol, sorbitol, sodium 
chloride in the composition. Prolonged absorption of the 
injectable compositions can be brought about by including 



ENSDOCID: <WO_9906426A1 J_> 



WO 99/06426 



PCTAJS98/16102 



- 57 - 

in the composition an agent which delays absorption, for 
example, aluminum monostearate and gelatin. 

Sterile injectable solutions can be prepared by 
incorporating the active compound (e.g., a Tango-77 
protein or anti-Tango-77 antibody) in the required amount 
in an appropriate solvent with one or a combination of 
ingredients enumerated above, as required, followed by 
filtered sterilization. Generally, dispersions are 
prepared by incorporating the active compound into a 
sterile vehicle which contains a basic dispersion medium 
and the required other ingredients from those enumerated 
above. In the case of sterile powders for the 
preparation of sterile injectable solutions, the 
preferred methods of preparation are vacuum drying and 
f reeze-drying which yields a powder of the active 
ingredient plus any additional desired ingredient from a 
previously sterile-filtered solution thereof. 

Oral compositions generally include an inert 
diluent or an edible carrier. They can be enclosed in 
gelatin capsules or compressed into tablets. For the 
purpose of oral therapeutic administration, the active 
compound can be incorporated with excipients and used in 
the form of tablets, troches, or capsules. Oral 
compositions can also be prepared using a fluid carrier 
5 for use as a mouthwash, wherein the compound in the fluid 
carrier is applied orally and swished and expectorated or 
swallowed. Pharmaceutically compatible binding agents, 
and/or adjuvant materials can be included as part of the 
composition. The tablets, pills, capsules, troches and 
o the like can contain any of the following ingredients, or 
compounds of a similar nature: a binder such as 
microcrystalline cellulose, gum tragacanth or gelatin; an 
excipient such as starch or lactose, a disintegrating 
agent such as alginic acid, Primogel, or corn starch; a 
5 lubricant such as magnesium stearate or Sterotes; a 
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glidant such as colloidal silicon dioxide; a sweetening 
agent such as sucrose or saccharin; or a flavoring agent 
such as peppermint, methyl salicylate, or orange 
flavoring . 

5 For administration by inhalation, the compounds 

are delivered in the form of an aerosol spray from a 
pressurized container or dispenser which contains a 
suitable propellant, e.g., a gas such as carbon dioxide, 
or a nebul i zer . 
io Systemic administration can also be by 

transmucosal or transdermal means. For transmucosal or 
transdermal administration, penetrants appropriate to the 
barrier to be permeated are used in the formulation* 
Such penetrants are generally known in the art, and 
15 include, for example, for transmucosal administration, 
detergents, bile salts, and fusidic acid derivatives. 
Transmucosal administration can be accomplished through 
the use of nasal sprays or suppositories. For 
transdermal administration, the active compounds are 
20 formulated into ointments, salves, gels, or creams as 
generally known in the art. 

The compounds can also be prepared in the form of 
suppositories (e.g., with conventional suppository bases 
such as cocoa butter and other glycerides) or retention 
25 enemas for rectal delivery. 

In one embodiment, the active compounds are 
prepared with carriers that will protect the compound 
against rapid elimination from the body, such as a 
controlled release formulation, including implants and 
30 microencapsulated delivery systems. Biodegradable, 
biocompatible polymers can be used, such as ethylene 
vinyl acetate , polyanhydrides , polyglycolic acid, 
collagen, polyorthoesters , and polylactic acid. Methods 
for preparation of such formulations will be apparent to 
35 those skilled in the art. The materials can also be 
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obtained commercially from Alza Corporation and Nova 
Pharmaceuticals, Inc. Liposomal suspensions (including 
liposomes targeted to infected cells with monoclonal 
antibodies to viral antigens) can also be used as 
5 pharmaceutical ly acceptable carriers. These can be 

prepared according to methods known to those skilled in 
the art, for example, as described in U.S. Patent No. 
4 , 522 , 811 . 

It is especially advantageous to formulate oral or 

10 parenteral compositions in dosage unit form for ease of 
administration and uniformity of dosage. Dosage unit 
form as used herein refers to physically discrete units 
suited as unitary dosages for the subject to be treated; 
each unit containing a predetermined quantity of active 

15 compound calculated to produce the desired therapeutic 
effect in association with the required pharmaceutical 
carrier. The specification for the dosage unit forms of 
the invention are dictated by and directly dependent on 
the unique characteristics of the active compound and the 

20 particular therapeutic effect to be achieved, and the 
limitations inherent in the art of compounding such an 
active compound for the treatment of individuals. 

The nucleic acid molecules of the invention can be 
inserted into vectors and used as gene therapy vectors . 

25 Gene therapy vectors can be delivered to a subject by, 
for example, intravenous injection, local administration 
(U.S. Patent 5,328,470) or by stereotactic injection 
(see, e.g., Chen et al . (1994) Proc . Natl. Acad. Sci. USA 
91:3054-3057). The pharmaceutical preparation of the 

30 gene therapy vector can include the gene therapy vector 
in an acceptable diluent, or can comprise a slow release 
matrix in which the gene delivery vehicle is imbedded. 
Alternatively, where the complete gene delivery vector 
can be produced intact from recombinant cells, e.g. 

35 retroviral vectors, the pharmaceutical preparation can 
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include one or more cells which produce the gene delivery- 
system. 

The pharmaceutical compositions can be included in 
a container, pack, or dispenser together with 
5 instructions for administration. 

V. Uses and Methods of the Invention 

The nucleic acid molecules, proteins, protein 
homologues, and antibodies described herein can be used 
in one or more of the following methods: a) screening 
10 assays; b) detection assays (e.g., chromosomal mapping, 
tissue typing, forensic biology) ; c) predictive medicine 
(e.g., diagnostic assays, prognostic assays, monitoring 
clinical trials, and pharmacogenomics) ; and d) methods of 
treatment (e.g., therapeutic and prophylactic). A 
is Tango-77 protein interacts with other cellular proteins 
and can thus be used for regulation of inflammation. The 
polypeptides of the invention can be used in assays to 
determine biological activity. For example, they could 
be used in a panel of proteins for high- throughput 
20 screening. 

The isolated nucleic acid molecules of the 
invention can be used to express Tango-77 protein (e.g., 
via a recombinant expression vector in a host cell in 
gene therapy applications) , to detect Tango-77 mRNA 
25 (e.g., in a biological sample) or a genetic lesion in a 
Tango-77 gene, and to modulate Tango-77 activity. In 
addition, the Tango-77 proteins can be used to screen 
drugs or compounds which modulate the Tango-77 activity 
or expression as well as to treat disorders characterized 
30 by insufficient or excessive production of Tango-77 
protein or production of Tango-77 protein forms which 
have decreased or aberrant activity compared to Tango-77 
wild type protein. In addition, the anti -Tango- 77 
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antibodies of the invention can be used to detect and 
isolate Tango-77 proteins and modulate Tango-77 activity. 

This invention further pertains to novel agents 
identified by the above-described screening assays and 
5 uses thereof for treatments as described herein. 

A. Screening Assays 

The invention provides a method (also referred to 

herein as a "screening assay") for identifying 

modulators, i.e., candidate or test compounds or agents 
10 (e.g., peptides, pept idomimetics , small molecules or 

other drugs) which bind to Tango-77 proteins or have a 

stimulatory or inhibitory effect on, for example, 

Tango-77 expression or Tango-77 activity. 

Examples of methods for the synthesis of molecular 
is libraries can be found in the art, for example in: 

DeWitt et al . (1993) Proc . Natl. Acad. Sci . USA 90:6909; 

Erb et al . (1994) Proc. Natl. Acad. Sci. USA 91:11422; 

Zuckermann et al . (1994). J. Med. Chem. 37:2678; Cho et 

al. (1993) Science 261:1303; Carrell et al . (1994) Angew. 
20 Chem. Int. Ed. Engl. 33:2059; Carell et al . (1994) Angew. 

Chem. Int. Ed. Engl. 33:2061; and Gallop et al . (1994) J . 

Med. Chem. 37:1233. 

Libraries of compounds may be presented in 

solution (e.g., Houghten (1992) Bio/Techniques 13:412- 
25 421), or on beads (Lam (1991) Nature 354:82-84), chips 
(Fodor (1993) Nature 364:555-556), bacteria (U.S. Patent 

No. 5,223,409), spores (Patent Nos . 5,571,698; 5,403,484; 

and 5,223,409), plasmids (Cull et al . (1992) Proc . Natl. 

Acad. Sci. USA 89:1865-1869) or phage (Scott and Smith 
30 (1990) Science 249:386-390; Devlin (1990) Science 

249:404-406; Cwirla et al . (1990) Proc. Natl. Acad. Sci. 
USA 87:6378-6382; and Felici (1991) J. Mol . Biol. 

222 : 301-310) . 
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In another embodiment, an assay is used to 
determine the ability of the test compound to modulate 
the activity of Tango-77 or a biologically active portion 
thereof, for example, by determining the ability of the 
Tango-77 protein to bind to or interact with a Tango-77 
target molecule. As used herein, a "target molecule" is 
a molecule with which a Tango-77 protein binds or 
interacts in nature, for example, a molecule on the 
surface of a cell. A Tango-77 target molecule can be a 
non-Tango-77 molecule or a Tango-77 protein or 
polypeptide of the present invention. In one embodiment, 
a Tango-77 target molecule is a component of a signal 
transduction pathway, for example, Tango-77 may bind to a 
IL-1 receptor or another receptor thereby blocking the 
receptor and inhibiting future signal transduction. 
Determining the ability of the Tango-77 protein to bind 
to or interact with a Tango-77 target molecule can be 
accomplished by one of the methods described above. In a 
preferred embodiment, determining the ability of the 
o Tango-77 protein to bind to or interact with a Tango-77 
target molecule can be accomplished by determining the 
activity of the target molecule. For example, the 
activity of the target molecule can be determined by 
detecting induction of a cellular second messenger of the 
5 target (e.g., intracellular Ca 2+ , diacylglycerol , IP3, 
etc.), detecting catalytic/enzymatic activity of the 
target on an appropriate substrate, detecting the 
induction of a reporter gene (e.g., a Tango-77 -responsive 
regulatory element operably linked to a nucleic acid 
o encoding a detectable marker, e.g. lucif erase) , or 

detecting a cellular response, for example, inflammation. 

In yet another embodiment, an assay of the present 
invention is a cell -free assay comprising contacting a 
Tango-77 protein or biologically active portion thereof 
5 with a test compound and determining the ability of the 
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test compound to bind to the Tango-77 protein or 
biologically active portion thereof. Binding of the test 
compound to the Tango-77 protein can be determined either 
directly or indirectly as described above. In a 
5 preferred embodiment, the assay includes contacting the 
Tango-77 protein or biologically active portion thereof 
with a known compound which binds Tango-7 7 to form an 
assay mixture, contacting the assay mixture with a test 
compound, and determining the ability of the test 
io compound to interact with a Tango-77 protein, wherein 

determining the ability of the test compound to interact 
with a Tango-77 protein comprises determining the ability 
of the test compound to preferentially bind to Tango-77 
or biologically active portion thereof as compared to the 
is known compound . 

In another embodiment, an assay is a cell -free 
assay comprising contacting Tango-77 protein or 
biologically active portion thereof with a test compound 
and determining the ability of the test compound to 
20 modulate (e.g., stimulate or inhibit) the activity of the 
Tango-77 protein or biologically active portion thereof. 
Determining the ability of the test compound to modulate 
the activity of Tango-77 can be accomplished, for 
example, by determining the ability of the Tango-77 
25 protein to bind to a Tango-77 target molecule by one of 
the methods described above for determining direct 
binding. In an alternative embodiment, determining the 
ability of the test compound to modulate the activity of 
Tango-77 can be accomplished by determining the ability 
30 of the Tango-77 protein to further modulate a Tango-77 
target molecule. For example, the catalytic/enzymatic 
activity of the target molecule on an appropriate 
substrate can be determined as previously described. 

In yet another embodiment, the cell -free assay 
35 comprises contacting the Tango-77 protein or biologically 
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active portion thereof with a known compound which binds 
Tango-77 to form an assay mixture, contacting the assay 
mixture with a test compound, and determining the ability 
of the test compound to interact with a Tango-77 protein, 
5 wherein determining the ability of the test compound to 
interact with a Tango-77 protein comprises determining 
the ability of the Tango-77 protein to preferentially 
bind to or modulate the activity of a Tango-77 target 
molecule . 

10 It is possible that membrane -bound forms of Tango- 

77 exist. The cell -free assays of the present invention 
are amenable to use of both the forms Tango-77. In the 
case of cell -free assays comprising a membrane -bound form 
of Tango-77, it may be desirable to utilize a 
is solubilizing agent such that the membrane -bound form of 
Tango-77 is maintained in solution. Examples of such 
solubilizing agents include non- ionic detergents such as 
n-octylglucoside , n-dodecylglucoside , n-dodecylmaltoside , 
octanoyl -N-methylglucamide , decanoyl -N-methylglucamide , 
20 Triton® X-100, Triton® X-114, Thesit®, 

Isotridecypoly (ethylene glycol ether)n, 3-[(3- 
cholamidopropyl) dimethylamminio] -1 -propane sulfonate 
(CHAPS) , 3- [ (3 -cholamidopropyl) dimethylamminio] - 2- 
hydroxy-1 -propane sulfonate (CHAPSO) , or N-dodecyl=N, N- 
25 dimethyl -3 -ammonio-1 -propane sulfonate. 

In more than one embodiment of the above assay 
methods of the present invention, it may be desirable to 
immobilize either Tango-77 or its target molecule to 
facilitate separation of complexed from uncomplexed forms 
30 of one or both of the proteins, as well as to accommodate 
automation of the assay. Binding of a test compound to 
Tango-77, or interaction of Tango-77 with a target 
molecule in the presence and absence of a candidate 
compound, can be accomplished in any vessel suitable for 
35 containing the reactants. Examples of such vessels 
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include microtitre plates, test tubes, and micro- 
centrifuge tubes. In one embodiment, a fusion protein 
can be provided which adds a domain that allows one or 
both of the proteins to be bound to a matrix. For 
5 example, glutathione-S- transferase/ Tango- 77 fusion 
proteins or glutathione-S- transf erase/target fusion 
proteins can be adsorbed onto glutathione sepharose beads 
(Sigma Chemical Co.; St, Louis, MO) or glutathione 
derivatized microtitre plates, which are then combined 

10 with the test compound or the test compound and either 
the non-adsorbed target protein or Tango- 77 protein, and 
the mixture incubated under conditions conducive to 
complex formation (e.g., at physiological conditions for 
salt and pH) . Following incubation, the beads or 

is microtitre plate wells are washed to remove any unbound 
components and complex formation is measured either 
directly or indirectly, for example, as described above. 
Alternatively, the complexes can be dissociated from the 
matrix, and the level of Tango-77 binding or activity 

20 determined using standard techniques. 

Other techniques for immobilizing proteins on 
matrices can also be used in the screening assays of the 
invention. For example, either Tango-77 or its target 
molecule can be immobilized utilizing conjugation of 

25 biotin and streptavidin . Biotinylated Tango-77 or target 
molecules can be prepared from biotin-NHS (N- hydroxy - 
succinimide) using techniques well known in the art 
(e.g., biotinylation kit, Pierce Chemicals; Rockford, 
IL) , and immobilized in the wells of streptavidin-coated 

30 96 well plates (Pierce Chemical) . Alternatively, 

antibodies reactive with Tango-77 or target molecules but 
which do not interfere with binding of the Tango-77 
protein to its target molecule can be derivatized to the 
wells of the plate, and unbound target or Tango-77 
35 trapped in the wells by antibody conjugation. Methods 
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for detecting such complexes, in addition to those 
described above for the GST- immobilized complexes, 
include immunodetection of complexes using antibodies 
reactive with the Tango-77 or target molecule, as well as 
5 enzyme- linked assays which rely on detecting an enzymatic 
activity associated with the Tango-77 or target molecule. 

In another embodiment, modulators of Tango-77 
expression are identified in a method in which a cell is 
contacted with a candidate compound and the expression of 
10 Tango-77 mRNA or protein in the cell is determined. The 
level of expression of Tango-77 mRNA or protein in the 
presence of the candidate compound is compared to the 
level of expression of Tango-77 mRNA or protein in the 
absence of the candidate compound. The candidate 
15 compound can then be identified as a modulator of 
Tango-77 expression based on this comparison. For 
example, when expression of Tango-77 mRNA or protein is 
greater (statistically significantly greater) in the 
presence of the candidate compound than in its absence, 
20 the candidate compound is identified as a stimulator of 
Tango-77 mRNA or protein expression. Alternatively, when 
expression of Tango-77 mRNA or protein is less 
(statistically significantly less) in the presence of the 
candidate compound than in its absence, the candidate 
25 compound is identified as an inhibitor of Tango-77 mRNA 
or protein expression. The level of Tango-77 mRNA or 
protein expression in the cells can be determined by 
methods described herein for detecting Tango-7 7 mRNA or 
protein. 

30 In yet another aspect of the invention, the 

Tango-77 proteins can be used as "bait proteins" in a 
two-hybrid assay or three hybrid assay (see, e.g., U.S. 
Patent No. 5,283,317; Zervos et al . (1993) Cell 72:223- 
232; Madura et al . (1993) J . Biol. Chew. 268:12046-12054; 

35 Bartel et al . (1993) Bio/Techniques 14:920-924; Iwabuchi 
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et al . (1993) Oncogene 8:1693-1696; and PCT Publication 
No. WO 94/10300) , to identify other proteins, which bind 
to or interact with Tango-77 ( "Tango-77 -binding proteins" 
or "Tango-77-bp" ) and modulate Tango-77 activity. Such 
5 Tango-77 -binding proteins are also likely to be involved 
in the propagation of signals by the Tango-77 proteins 
as, for example, upstream or downstream elements of the 
Tango-77 pathway. 

The two-hybrid system is based on the modular 
10 nature of most transcription factors, which consist of 
separable DNA-binding and activation domains. Briefly, 
the assay utilizes two different DNA constructs. In one 
construct, the gene that codes for Tango-77 is fused to a 
gene encoding the DNA binding domain of a known 
15 transcription factor (e.g., GAL- 4) . In the other 
construct, a DNA sequence, from a library of DNA 
sequences, that encodes an unidentified protein ("prey" 
or "sample") is fused to a gene that codes for the 
activation domain of the known transcription factor. If 
20 the "bait" and the "prey" proteins are able to interact, 
in vivo, forming an Tango- 77 -dependent complex, the DNA- 
binding and activation domains of the transcription 
factor are brought into close proximity. This proximity 
allows transcription of a reporter gene (e.g., LacZ) 
25 which is operably linked to a transcriptional regulatory 
site responsive to the transcription factor. Expression 
of the reporter gene can be detected and cell colonies 
containing the functional transcription factor can be 
isolated and used to obtain the cloned gene which encodes 
30 the protein which interacts with Tango-77. 

This invention further pertains to novel agents 
identified by the above -described screening assays and 
uses thereof for treatments as described herein. 



NSDOCID: <WO 990S426A1 J_> 



WO 99/06426 



PCT/US98/16102 



- 68 - 
B . Detection Assays 

Portions or fragments of the cDNA sequence 
identified herein (and the corresponding complete gene 
sequences) can be used in numerous ways as polynucleotide 
5 reagents. For example, the sequence can be used to: (i) 
map the respective gene on a chromosome and, thus, locate 
gene regions associated with genetic disease; (ii) 
identify an individual from a minute biological sample 
(tissue typing) ; and (iii) aid in forensic identification 
10 of a biological sample. These applications are described 
in the subsections below. 

1 . Chromosome Mapping 

Once the sequence (or a portion of the sequence) 
of a gene has been isolated, this sequence can be used to 
15 map the location of the gene on a chromosome. 

Accordingly, Tango-77 nucleic acid molecules described 
herein or fragments thereof, can be used to map the 
location of the Tango-77 gene(s) on a chromosome. The 
mapping of the Tango-77 sequences to chromosomes is an 
20 important first step in correlating these sequences with 
genes associated with disease. 

Briefly, a Tango-77 gene can be mapped to 
chromosomes by preparing PCR primers (preferably 15-2 5 bp 
in length) from the Tango-77 sequences. Computer 
25 analysis of Tango-77 sequences can be used to rapidly 

select primers that do not span more than one exon in the 
genomic DNA, thus complicating the amplification process. 
These primers can then be used for PCR screening of 
somatic cell hybrids containing individual human 
30 chromosomes. Only those hybrids containing the human 
gene corresponding to the Tango-77 sequences will yield 
an amplified fragment. 

Somatic cell hybrids are prepared by fusing 
somatic cells from different mammals (e.g., human and 
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mouse cells) . As hybrids of human and mouse cells grow 
and divide, they gradually lose human chromosomes in 
random order, but retain the mouse chromosomes. By using 
media in which mouse cells cannot grow (because they lack 
5 a particular enzyme) but in which human cells can, the 
one human chromosome that contains the gene encoding the 
needed enzyme, will be retained. By using various media, 
panels of hybrid cell lines can be established. Each 
cell line in a panel contains either a single human 

10 chromosome or a small number of human chromosomes, and a 
full set of mouse chromosomes, allowing easy mapping of 
individual genes to specific human chromosomes. 
(D'Eustachio et al . (1983) Science 220:919-924). Somatic 
cell hybrids containing only fragments of human 

15 chromosomes can also be produced by using human 
chromosomes with translocations and deletions. 

PCR mapping of somatic cell hybrids is a rapid 
procedure for assigning a particular sequence to a 
particular chromosome. Three or more sequences can be 

20 assigned per day using a single thermal cycler. Using 
the Tango-77 sequences to design oligonucleotide primers, 
sublocalization can be achieved with panels of fragments 
from specific chromosomes. Other mapping strategies 
which can similarly be used to map a Tango-77 sequence to 

25 its chromosome include in situ hybridization (described 
in Fan et al . (1990) Proc. Natl. Acad. Sci . USA 87:6223- 
27) , pre -screening with labeled flow- sorted chromosomes, 
and pre-selection by hybridization to chromosome specific 
cDNA libraries. 

30 Fluorescence in situ hybridization (FISH) of a DNA 

sequence to a metaphase chromosomal spread can further be 
used to provide a precise chromosomal location in one 
step. Chromosome spreads can be made using cells whose 
division has been blocked in metaphase by a chemical, 

35 e.g., colcemid that disrupts the mitotic spindle. The 
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chromosomes can be treated briefly with trypsin, and then 
stained with Giemsa. A pattern of light and dark bands 
develops on each chromosome, so that the chromosomes can 
be identified individually. The FISH technique can be 
5 used with a DNA sequence as short as 500 or 600 bases. 
However, clones larger than 1,000 bases have a higher 
likelihood of binding to a unique chromosomal location 
with sufficient signal intensity for simple detection. 
Preferably 1,000 bases, and more preferably 2,000 bases 
10 will suffice to get good results at a reasonable amount 
of time. For a review of this technique, see Verma et 
al . (Human Chromosomes: A Manual of Basic Techniques 
(Pergamon Press, New York, 1988) ) . 

Reagents for chromosome mapping can be used 
15 individually to mark a single chromosome or a single site 
on that chromosome, or panels of reagents can be used for 
marking multiple sites and/or multiple chromosomes. 
Reagents corresponding to noncoding regions of the genes 
actually are preferred for mapping purposes. Coding 
20 sequences are more likely to be conserved within gene 
families, thus increasing the chance of cross 
hybridizations during chromosomal mapping. 

Once a sequence has been mapped to a precise 
chromosomal location, the physical position of the 
25 sequence on the chromosome can be correlated with genetic 
map data. (Such data are found, for example, in V. 
McKusick, Mendelian Inheritance in Man, available on-line 
through Johns Hopkins University Welch Medical Library) . 
The relationship between genes and disease, mapped to the 
30 same chromosomal region, can then be identified through 
linkage analysis (co- inheritance of physically adjacent 
genes), described in, e.g., Egeland et al . (1987) Nature 
325 : 783-787 . 

Moreover, differences in the DNA sequences between 
35 individuals affected and unaffected with a disease 
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associated with the Tango- 77 gene can be determined. If 
a mutation is observed in some or all of the affected 
individuals but not in any unaffected individuals, then 
the mutation is likely to be the causative agent of the 
5 particular disease. Comparison of affected and 

unaffected individuals generally involves first looking 
for structural alterations in the chromosomes such as 
deletions or translocations that are visible from 
chromosome spreads or detectable using PCR based on that 
10 DNA sequence. Ultimately, complete sequencing of genes 
from several individuals can be performed to confirm the 
presence of a mutation and to distinguish mutations from 
polymorphisms . 

2 . Tissue Typing 

15 The Tango-77 sequences of the present invention 

can also be used to identify individuals from minute 
biological samples. The United States military, for 
example, is considering the use of restriction fragment 
length polymorphism (RFLP) for identification of its 

20 personnel. In this technique, an individual's genomic 

DNA is digested with one or more restriction enzymes, and 
probed on a Southern blot to yield unique bands for 
identification. This method does not suffer from the 
current limitations of "Dog Tags" which can be lost, 

25 switched, or stolen, making positive identification 

difficult. The sequences of the present invention are 
useful as additional DNA markers for RFLP (described in 
U.S. Patent 5,272,057). 

Furthermore, the sequences of the present 

30 invention can be used to provide an alternative technique 
which determines the actual base-by-base DNA sequence of 
selected portions of an individual's genome. Thus, the 
Tango-77 sequences described herein can be used to 
prepare two PCR primers from the 5' and 3' ends of the 
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sequences. These primers can then be used to amplify an 
individual's DNA and subsequently sequence it. 

Panels of corresponding DNA sequences from 
individuals, prepared in this manner, can provide unique 
5 individual identifications, as each individual will have 
a unique set of such DNA sequences due to allelic 
differences. The sequences of the present invention can 
be used to obtain such identification sequences from 
individuals and from tissue. The Tango-77 sequences of 
io the invention uniquely represent portions of the human 
genome. Allelic variation occurs to some degree in the 
coding regions of these sequences, and to a greater 
degree in the noncoding regions. It is estimated that 
allelic variation between individual humans occurs with a 
is frequency of about once per each 500 bases. Each of the 
sequences described herein can, to some degree, be used 
as a standard against which DNA from an individual can be 
compared for identification purposes. Because greater 
numbers of polymorphisms occur in the noncoding regions, 
20 fewer sequences are necessary to differentiate 

individuals. The noncoding sequences of SEQ ID N0:1 can 
comfortably provide positive individual identification 
with a panel of perhaps 10 to 1,000 primers which each 
yield a noncoding amplified sequence of 100 bases. If 
25 predicted coding sequences, such as those in SEQ ID NO: 3, 
SEQ ID NO : 6 , or SEQ ID NO: 10 are used, a more appropriate 
number of primers for positive individual identification 
would be 500-2,000. 

If a panel of reagents from Tango-77 sequences 
30 described herein is used to generate a unique 

identification database for an individual, those same 
reagents can later be used to identify tissue from that 
individual. Using the unique identification database, 
positive identification of the individual, living or 
35 dead, can be made from extremely small tissue samples. 
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2. Use of Partial Tango-77 Sequences in Forensic 

Biology 

DNA-based identification techniques can also be 
used in forensic biology. Forensic biology is a 
5 scientific field employing genetic typing of biological 
evidence found at a crime scene as a means for positively 
identifying, for example, a perpetrator of a crime. To 
make such an identification, PCR technology can be used 
to amplify DNA sequences taken from very small biological 
io samples such as tissues, e.g., hair or skin, or body 
fluids, e.g., blood, saliva, or semen found at a crime 
scene. The amplified sequence can then be compared to a 
standard, thereby allowing identification of the origin 
of the biological sample. 
15 The sequences of the present invention can be used 

to provide polynucleotide reagents, e.g., PCR primers, 
targeted to specific loci in the human genome, which can 
enhance the reliability of DNA-based forensic 
identifications by, for example, providing another 
20 "identification marker" (i.e. another DNA sequence that 
is unique to a particular individual) . As mentioned 
above, actual base sequence information can be used for 
identification as an accurate alternative to patterns 
formed by restriction enzyme generated fragments. 
25 Sequences targeted to noncoding regions of SEQ ID NO;l 
are particularly appropriate for this use as greater 
numbers of polymorphisms occur in the noncoding regions, 
making it easier to differentiate individuals using this 
technique. Examples of polynucleotide reagents include 
30 the Tango-77 sequences or portions thereof, e.g., 

fragments derived from the noncoding regions of SEQ ID 
NO:l having a length of at least 2 0 or 3 0 bases. 

The Tango-77 sequences described herein can 
further be used to provide polynucleotide reagents, e.g., 
35 labeled or labelable probes which can be used in, for 
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example, an in situ hybridization technique, to identify 
a specific tissue, e.g., brain tissue. This can be very 
useful in cases where a forensic pathologist is presented 
with a tissue of unknown origin. Panels of such Tango-77 
5 probes can be used to identify tissue by species and/or 
by organ type . 

In a similar fashion, these reagents, e.g., 
Tango-77 primers or probes can be used to screen tissue 
culture for contamination (i.e., screen for the presence 
10 of a mixture of different types of cells in a culture) . 

C . Predictive Medicine 

The present invention also pertains to the field 
of predictive medicine in which diagnostic assays, 
prognostic assays, pharmacogenomics , and monitoring 
is clinical trails are used for prognostic (predictive) 

purposes to thereby treat an individual prophylact ically . 
Accordingly, one aspect of the present invention relates 
to diagnostic assays for determining Tango-7 7 protein 
and/or nucleic acid expression as well as Tango-77 
20 activity, in the context of a biological sample (e.g., 

blood, serum, cells, tissue) to thereby determine whether 
an individual is afflicted with a disease or disorder, or 
is at risk of developing a disorder, associated with 
aberrant Tango-77 expression or activity. The invention 
25 also provides for prognostic (or predictive) assays for 
determining whether an individual is at risk of 
developing a disorder associated with Tango-77 protein, 
nucleic acid expression or activity. For example, 
mutations in a Tango-77 gene can be assayed in a 
30 biological sample. Such assays can be used for 
prognostic or predictive purpose to thereby 
prophylactically treat an individual prior to the onset 
of a disorder characterized by or associated with 
Tango-77 protein, nucleic acid expression or activity. 
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Another aspect of the invention provides methods 
for determining Tango-77 protein, nucleic acid expression 
or Tango-77 activity in an individual to thereby select 
appropriate therapeutic or prophylactic agents for that 
5 individual (referred to herein as "pharmacogenomics"). 
Pharmacogenomics allows for the selection of agents 
(e.g., drugs) for therapeutic or prophylactic treatment 
of an individual based on the genotype of the individual 
(e.g., the genotype of the individual examined to 
io determine the ability of the individual to respond to a 
particular agent . ) 

Yet another aspect of the invention pertains to 
monitoring the influence of agents (e.g., drugs or other 
compounds) on the expression or activity of Tango-77 in 

is clinical trials. 

These and other agents are described in further 
detail in the following sections. 

1 . Diagnostic Assays 

An exemplary method for detecting the presence or 
20 absence of Tango-77 in a biological sample involves 
obtaining a biological sample from a test subject and 
contacting the biological sample with a compound or an 
agent capable of detecting Tango-77 protein or nucleic 
acid (e.g., mRNA, genomic DNA) that encodes Tango-77 
25 protein such that the presence of Tango-77 is detected in 
the biological sample. A preferred agent for detecting 
Tango-77 mRNA or genomic DNA is a labeled nucleic acid 
probe capable of hybridizing to Tango-77 mRNA or genomic 
DNA. The nucleic acid probe can be, for example, a full- 
30 length Tango-77 nucleic acid, such as the nucleic acid of 
SEQ ID NO: 1, SEQ ID NO : 3 SEQ ID NO : 6 , SEQ ID NO: 10 or a 
portion thereof, such as an oligonucleotide of at least 
15, 30, 50, 100, 250 or 500 nucleotides in length and 
sufficient to specifically hybridize under stringent 
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conditions to Tango-77 mRNA or genomic DNA. Other 
suitable probes for use in the diagnostic assays of the 
invention are described herein. 

A preferred agent for detecting Tango-77 protein 
5 is an antibody capable of binding to Tango-77 protein, 
preferably an antibody with a detectable label. 
Antibodies can be polyclonal, or more preferably, 
monoclonal. An intact antibody, or a fragment thereof 
(e.g., Fab or F(ab') a ) can be used. The term "labeled", 
o with regard to the probe or antibody, is intended to 
encompass direct labeling of the probe or antibody by 
coupling (i.e., physically linking) a detectable 
substance to the probe or antibody, as well as indirect 
labeling of the probe or antibody by reactivity with 
^5 another reagent that is directly labeled. Examples of 

indirect labeling include detection of a primary antibody 
using a f luorescently labeled secondary antibody and end- 
labeling of a DNA probe with biotin such that it can be 
detected with f luorescently labeled streptavidin. The 
term "biological sample" is intended to include tissues, 
cells and biological fluids isolated from a subject, as 
well as tissues, cells and fluids present within a 
subject. That is, the detection method of the invention 
can be used to detect Tango-77 mRNA, protein, or genomic 
25 DNA in a biological sample in vitro as well as in vivo. 
For example, in vitro techniques for detection of 
Tango-77 mRNA include Northern hybridizations and in situ 
hybridizations. In vitro techniques for detection of 
Tango-77 protein include enzyme linked immunosorbent 
30 assays (ELISAs) , Western blots, immunoprecipitations and 
immunofluorescence. In vitro techniques for detection of 
Tango-77 genomic DNA include Southern hybridizations. 
Furthermore, in vivo techniques for detection of Tango-77 
protein include introducing into a subject a labeled 
, 5 anti-Tango-77 antibody. For example, the antibody can be 



20 
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labeled with a radioactive marker whose presence and 
location in a subject can be detected by standard imaging 
techniques . 

In one embodiment, the biological sample contains 
5 protein molecules from the test subject. Alternatively, 
the biological sample can contain mRNA molecules from the 
test subject or genomic DNA molecules from the test 
subject. A preferred biological sample is a peripheral 
blood leukocyte sample isolated by conventional means 
10 from a subject. 

In another embodiment, the methods further involve 
obtaining a control biological sample from a control 
subject, contacting the control sample with a compound or 
agent capable of detecting Tango-77 protein, mRNA, or 
is genomic DNA, such that the presence of Tango-77 protein, 
mRNA or genomic DNA is detected in the biological sample, 
and comparing the presence of Tango-77 protein, mRNA or 
genomic DNA in the control sample with the presence of 
Tango-77 protein, mRNA or genomic DNA in the test sample. 
20 The invention also encompasses kits for detecting 

the presence of Tango-77 in a biological sample (a test 
sample) . Such kits can be used to determine if a subject 
is suffering from or is at increased risk of developing a 
disorder associated with aberrant expression of Tango-77 
25 (e.g., an immunological disorder) . For example, the kit 
can comprise a labeled compound or agent capable of 
detecting Tango-77 protein or mRNA in a biological sample 
and means for determining the amount of Tango-77 in the 
sample (e.g., an anti -Tango-77 antibody or an 
30 oligonucleotide probe which binds to DNA encoding 

Tango-77, e.g., SEQ ID NO : 1 or SEQ ID NO:3 or SEQ ID 
N0:6, or SEQ ID NO:10). Kits may also include 
instruction for observing that the tested subject is 
suffering from or is at risk of developing a disorder 
35 associated with aberrant expression of Tango-77 if the 
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amount of Tango- 77 protein or mRNA is above or below a 
normal level . 

For antibody-based kits, the kit may comprise, for 
example: (1) a first antibody (e.g., attached to a solid 
5 support) which binds to Tango- 77 protein; and, optionally 
(2) a second, different antibody which binds to Tango-77 
protein or the first" antibody and is conjugated to a 
detectable agent. 

For oligonucleotide-based kits, the kit may 
10 comprise, for example: (1) an oligonucleotide, e.g., a 
detectably labelled oligonucleotide, which hybridizes to 
a Tango-77 nucleic acid sequence or (2) a pair of primers 
useful for amplifying a Tango-77 nucleic acid molecule; 
The kit may also comprise, e.g., a buffering 
is agent, a preservative, or a protein stabilizing agent. 
The kit may also comprise components necessary for 
detecting the detectable agent (e.g., an enzyme or a 
substrate) . The kit may also contain a control sample or 
a series of control samples which can be assayed and 
20 compared to the test sample contained. Each component of 
the kit is usually enclosed within an individual 
container and all of the various containers are within a 
single package along with instructions for observing 
whether the tested subject is suffering from or is at 
25 risk of developing a disorder associated with aberrant 
expression of Tango-77. 

2 . Prognostic Assays 

The methods described herein can furthermore be 
utilized as diagnostic or prognostic assays to identify 
30 subjects having or at risk of developing a disease or 

disorder associated with aberrant Tango-77 expression or 
activity. For example, the assays described herein, such 
as the preceding diagnostic assays or the following 
assays, can be utilized to identify a subject having or 
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at risk of developing a disorder associated with aberrant 
expression or activity. Thus, the present invention 
provides a method in which a test sample is obtained from 
a subject and Tango-77 protein or nucleic acid (e.g., 

5 mRNA, genomic DNA) is detected, wherein the presence of 
Tango-77 protein or nucleic acid is diagnostic for a 
subject having or at risk of developing a disease or 
disorder associated with aberrant Tango-77 expression or 
activity. As used herein, a "test sample" refers to a 

10 biological sample obtained from a subject of interest. 
For example, a test sample can be a biological fluid 
(e.g., serum), cell sample, or tissue. 

Furthermore, the prognostic assays described 
herein can be used to determine whether a subject can be 

is administered an agent (e.g., an agonist, antagonist, 
peptidomimetic, protein, peptide, nucleic acid, small 
molecule, or other drug candidate) to treat a disease or 
disorder associated with aberrant Tango-77 expression or 
activity. For example, such methods can be used to 

20 determine whether a subject can be effectively treated 

with a specific agent or class of agents (e.g., agents of 
a type which decrease Tango-77 activity) . Thus, the 
present invention provides methods for determining 
whether a subject can be effectively treated with an 

25 agent for a disorder associated with aberrant Tango-77 

expression or activity in which a test sample is obtained 
and Tango-77 protein or nucleic acid is detected (e.g., 
wherein the presence of Tango-77 protein or nucleic acid 
is diagnostic for a subject that can be administered the 

30 agent to treat a disorder associated with aberrant 
Tango-77 expression or activity) . 

The methods of the invention can also be used to 
detect genetic lesions or mutations in a Tango-77 gene, 
thereby determining if a subject with the lesioned gene 
35 is at risk for a disorder characterized by aberrant 
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inflammation. In preferred embodiments, the methods 
include detecting, in a sample of cells from the subject, 
the presence or absence of a genetic lesion or mutation 
characterized by at least one of an alteration affecting 
5 the integrity of a gene encoding a Tango- 77 -protein, or 
the mis-expression of the Tango-77 gene. For example, 
such genetic lesions or mutations can be detected by 
ascertaining the existence of at least one of: 1) a 
deletion of one or more nucleotides from a Tango-77 gene; 
10 2) an addition of one or more nucleotides to a Tango-77 
gene; 3) a substitution of one or more nucleotides of a 
Tango-77 gene; 4) a chromosomal rearrangement of a 
Tango-77 gene; 5) an alteration in the level of a 
messenger RNA transcript of a Tango-77 gene; 6) an 
is aberrant modification of a Tango-77 gene, such as of the 
methylation pattern of the genomic DNA; 7) the presence 
of a non-wild type splicing pattern of a messenger RNA 
transcript of a Tango-77 gene; 8) a non-wild type level 
of a Tango-77-protein; 9) an allelic loss of a Tango-77 
20 gene, and 10) an inappropriate post- translational 

modification of a Tango-77-protein . As described herein, 
there are a large number of assay techniques known in the 
art which can be used for detecting lesions or mutations 
in a Tango-77 gene. A preferred biological sample is a 
25 peripheral blood leukocyte sample isolated by 
conventional means from a subject. 

In certain embodiments, detection of the lesion 
involves the use of a probe/primer in a polymerase chain 
reaction (PCR) (see, e.g., U.S. Patent Nos . 4,683,195 and 
30 4,683,202), such as anchor PCR or RACE PCR, or, 

alternatively, in a ligation chain reaction (LCR) (see, 
e.g., Landegran et al . (1988) Science 241:1077-1080; and 
Nakazawa et al . (1994) Proc . Natl. Acad. Sci • USA 91:360- 
364) , the latter of which can be particularly useful for 
35 detecting point mutations in the Tango-77-gene (see, 
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e.g., Abravaya et al . (1995) Nucleic Acids Res. 23:675- 
682) . This method can include the steps of collecting a 
sample of cells from a patient, isolating nucleic acid 
(e.g., genomic, mRNA or both) from the cells of the 
5 sample, contacting the nucleic acid sample with one or 
more primers which specifically hybridize to a Tango-77 
gene under conditions such that hybridization and 
amplification of the Tango-77-gene (if present) occurs, 
and detecting the presence or absence of an amplification 
10 product, or detecting the size of the amplification 

product and comparing the length to a control sample. It 
is anticipated that PCR and/or LCR may be desirable to 
use as a preliminary amplification step in conjunction 
with any of the techniques used for detecting mutations 
is described herein. 

Alternative amplification methods include: self 
sustained sequence replication (Guatelli et al . (1990) 
Proc. Natl. Acad. Sci. USA 87:1874-1878), transcriptional 
amplification system (Kwoh, et al . (1989) Proc. Natl. 
20 Acad. Sci. USA 86:1173-1177) , Q-Beta Replicase (Lizardi 
et al. (1988) Bio /Technology 6:1197), or any other 
nucleic acid amplification method, followed by the 
detection of the amplified molecules using techniques 
well known to those of skill in the art. These detection 
25 schemes are especially useful for the detection of 

nucleic acid molecules if such molecules are present in 
very low numbers . 

In an alternative embodiment, mutations in a 
Tango-77 gene from a sample cell can be identified by 
30 alterations in restriction enzyme cleavage patterns. For 
example, sample and control DNA is isolated, amplified 
(optionally) , digested with one or more restriction 
endonuc leases, and fragment length sizes are determined 
by gel electrophoresis and compared. Differences in 
35 fragment length sizes between sample and control DNA 
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indicates mutations in the sample DNA. Moreover, the use 
of sequence specific ribozymes (see, e.g., U.S. Patent 
No. 5,498,531) can be used to score for the presence of 
specific mutations by development or loss of a ribozyme 
5 cleavage site . 

In other embodiments, genetic mutations in 
Tango- 77 can be identified by hybridizing a sample and 
control nucleic acids, e.g., DNA or RNA, to high density 
arrays containing hundreds or thousands of 
10 oligonucleotides probes (Cronin et al . (1996) Human 

Mutation 7:244-255; Kozal et al . (1996) Nature Medicine 
2:753-759). For example, genetic mutations in Tango-77 
can be identified in two-dimensional arrays containing 
light -generated DNA probes as described in Cronin et al . 
is supra. Briefly, a first hybridization array of probes 
can be used to scan through long stretches of DNA in a 
sample and control to identify base changes between the 
sequences by making linear arrays of sequential 
overlapping probes. This step allows the identification 
20 of point mutations. This step is followed by a second 
hybridization array that allows the characterization of 
specific mutations by using smaller, specialized probe 
arrays complementary to all variants or mutations 
detected. Each mutation array is composed of parallel 
25 probe sets, one complementary to the wild- type gene and 
the other complementary to the mutant gene. 

In yet another embodiment, any of a variety of 
sequencing reactions known in the art can be used to 
directly sequence the Tango-77 gene and detect mutations 
30 by comparing the sequence of the sample Tango-77 with the 
corresponding wild- type (control) sequence. Examples of 
sequencing reactions include those based on techniques 
developed by Maxim and Gilbert ((1977) Proc . Natl. Acad. 
Sci. USA 74:560) or Sanger ((1977) Proc. Natl. Acad. 
35 Sci. USA 74:5463) . It is also contemplated that any of a 
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variety of automated sequencing procedures can be 
utilized when performing the diagnostic assays ( (1995) 
Bio/Techniques 19:448), including sequencing by mass 
spectrometry (see, e.g., PCT Publication No. WO 94/16101; 
5 Cohen et al . (1996) Adv. Chromatogr. 36:127-162; and 

Griffin et al . (1993) Appl . Biochem. Biotechnol. 38:147- 
159) . 

Other methods for detecting mutations in the 
Tango-77 gene include methods in which protection from 

io cleavage agents is used to detect mismatched bases in 
RNA/RNA or RNA/DNA heteroduplexes (Myers et al . (1985) 
Science 230:1242). In general, the technique of 
"mismatch cleavage" entails providing heteroduplexes 
formed by hybridizing (labeled) RNA or DNA containing the 

15 wild-type Tango-77 sequence with potentially mutant RNA 
or DNA obtained from a tissue sample. The double- 
stranded duplexes are treated with an agent which cleaves 
single- stranded regions of the duplex such as which will 
exist due to basepair mismatches between the control and 

20 sample strands. RNA/DNA duplexes can be treated with 
RNase to digest mismatched regions, and DNA/ DNA hybrids 
can be treated with SI nuclease to digest mismatched 
regions. In other embodiments, either DNA /DNA or RNA/DNA 
duplexes can be treated with hydroxylamine or osmium 

25 tetroxide and with piperidine in order to digest 

mismatched regions. After digestion of the mismatched 
regions, the resulting material is then separated by size 
on denaturing polyacrylamide gels to determine the site 
of mutation. See, e.g., Cotton et al . (1988) Proc . Natl. 

30 Acad. Sci. USA 85:4397; Saleeba et al . (1992) Methods 
Enzymol. 217:286-295. In a preferred embodiment, the 
control DNA or RNA can be labeled for detection. 

In' still another embodiment, the mismatch cleavage 
reaction employs one or more proteins that recognize 
35 mismatched base pairs in double- stranded DNA (so called 
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"DNA mismatch repair" enzymes) in defined systems for 
detecting and mapping point mutations in Tango-77 cDNAs 
obtained from samples of cells. For example, the mutY 
enzyme of E. coli cleaves A at G/A mismatches and the 
5 thymidine DNA glycosylase from HeLa cells cleaves T at 
G/T mismatches (Hsu et al . (1994) Carcinogenesis 15:1657- 
1662) . According to an exemplary embodiment, a probe 
based on a Tango-77 sequence, e.g., a wild-type Tango-77 
sequence, is hybridized to a cDNA or other DNA product 
10 from a test cell (s) . The duplex is treated with a DNA 
mismatch repair enzyme, and the cleavage products, if 
any, can be detected from electrophoresis protocols or 
the like. See, e.g., U.S. Patent No. 5,459,039. 
In other embodiments, alterations in 
is electrophoretic mobility will be used to identify 

mutations in Tango-77 genes. For example, single strand 
conformation polymorphism (SSCP) may be used to detect 
differences in electrophoretic mobility between mutant 
and wild type nucleic acids (Orita et al . (1989) Proc. 
20 Natl. Acad. Sci. USA 86:2766; see also Cotton (1993) 

Mutat. Res. 285:125-144; Hayashi (1992) Genet Anal Tech 
Appl 9:73-79). Single- stranded DNA fragments of sample 
and control Tango-77 nucleic acids will be denatured and 
allowed to renature. The secondary structure of single- 
25 stranded nucleic acids varies according to sequence, and 
the resulting alteration in .electrophoretic mobility 
enables the detection of even a single base change. The 
DNA fragments may be labeled or detected with labeled 
probes. The sensitivity of the assay may be enhanced by 
30 using RNA (rather than DNA) , in which the secondary 

structure is more sensitive to a change in sequence . In 
a preferred embodiment, the subject method utilizes 
heteroduplex analysis to separate double stranded 
heteroduplex molecules on the basis of changes in 
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electrophoretic mobility (Keen et al . (1991) Trends Genet 
7:5). 

In yet another embodiment, the movement of mutant 
or wild-type fragments in polyacryl amide gels containing 
5 a gradient of denaturant is assayed using denaturing 

gradient gel electrophoresis (DGGE) (Myers et al . (1985) 
Nature 313:495) . When DGGE is used as the method of 
analysis, DNA will be modified to insure that it does not 
completely denature, for example by adding a GC clamp of 

10 approximately 40 bp of high-melting GC-rich DNA by PCR . 
In a further embodiment , a temperature gradient is used 
in place of a denaturing gradient to identify differences 
in the mobility of control and sample DNA (Rosenbaum and 
Reissner (1987) Biophys . Chem. 265:12753). 

is Examples of other techniques for detecting point 

mutations include, but are not limited to, selective 
oligonucleotide hybridization, selective amplification, 
or selective primer extension. For example, 
oligonucleotide primers may be prepared in which the 

20 known mutation is placed centrally and then hybridized to 
target DNA under conditions which permit hybridization 
only if a perfect match is found (Saiki et al . (1986) 
Nature 324:163); Saiki et al . (1989) Proc . Natl. Acad. 
Sci . USA 86:6230). Such allele specific oligonucleotides 

25 are hybridized to PCR amplified target DNA or a number of 
different mutations when the oligonucleotides are 
attached to the hybridizing membrane and hybridized with 
labeled target DNA. 

Alternatively, allele specific amplification 

30 technology which depends on selective PCR amplification 
may be used in conjunction with the instant invention. 
Oligonucleotides used as primers for specific 
amplification may carry the mutation of interest in the 
center of the molecule (so that amplification depends on 
35 differential hybridization) (Gibbs et al . (1989) Nucleic 
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Acids Res. 17:2437-2448) or at the extreme 3' end of one 
primer where, under appropriate conditions, mismatch can 
prevent or reduce polymerase extension (Prossner (1993) 
Tibtech 11:238) . In addition, it may be desirable to 
5 introduce a novel restriction site in the region of the 
mutation to create cleavage-based detection (Gasparini et 
al . (1992) Mol. Cell Probes 6:1) . It is anticipated that 
in certain embodiments amplification may also be 
performed using Taq ligase for amplification (Barany 

10 (1991) Proc. Natl. Acad. Sci USA 88:189). In such cases, 
ligation will occur only if there is a perfect match at 
the 3' end of the 5' sequence making it possible to 
detect the presence of a known mutation at a specific 
site by looking for the presence or absence of 

is amplification. 

The methods described herein may be performed, for 
example, by utilizing pre-packaged diagnostic kits 
comprising at least one probe nucleic acid or antibody 
reagent described herein, which may be conveniently used, 

20 e.g., in clinical settings to diagnose patients 

exhibiting symptoms or family history of a disease or 
illness involving a Tango- 77 gene. 

Furthermore, any cell type or tissue, preferably 
peripheral blood leukocytes, in which Tango- 77 is 

25 expressed may be utilized in the prognostic assays 
described herein. 

3 . Pharmac ogenomi c s 

Agents, or modulators which have a stimulatory or 
30 inhibitory effect on Tango-77 activity (e.g., Tango-77 
gene expression) as identified by a screening assay 
described herein can be administered to individuals to 
treat (prophylactically or therapeutically) disorders 
(e.g., acute or chronic inflammation and asthma) 
35 associated with aberrant Tango-77 activity. In 
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conjunction with such treatment, the pharmacogenomics 
(i.e., the study of the relationship between an 
individual's genotype and that individual's response to a 
foreign compound or drug) of the individual may be 
5 considered. Differences in metabolism of therapeutics 
can lead to severe toxicity or therapeutic failure by 
altering the relation between dose and blood 
concentration of the pharmacologically active drug. Thus, 
the pharmacogenomics of the individual permits the 
10 selection of effective agents (e.g., drugs) for 
prophylactic or therapeutic treatments based on a 
consideration of the individual's genotype. Such 
pharmacogenomics can further be used to determine 
appropriate dosages and therapeutic regimens, 
is Accordingly, the activity of Tango- 77 protein, expression 
of Tango-77 nucleic acid, or mutation content of Tango-77 
genes in an individual can be determined to thereby 
select appropriate agent (s) for therapeutic or 
prophylactic treatment of the individual. 
20 Pharmacogenomics deals with clinically significant 

hereditary variations in the response to drugs due to 
altered drug disposition and abnormal action in affected 
persons. See, e.g., Linder (1997) Clin. Chem. 
43 (2) : 254-266 . In general, two types of pharmacogenetic 
25 conditions can be differentiated. Genetic conditions 

transmitted as a single factor altering the way drugs act 
on the body are referred to as "altered drug action." 
Genetic conditions transmitted as single factors altering 
the way the body acts on drugs are referred to as 
30 "altered drug metabolism" . These pharmacogenetic 
conditions can occur either as rare defects or as 
polymorphisms. For example, glucose -6 -phosphate 
dehydrogenase deficiency (G6PD) is a common inherited 
enzymopathy in which the main clinical complication is 
35 haemolysis after ingestion of oxidant drugs (anti- 
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malarials, sulfonamides, analgesics, nitrofurans) and 
consumption of fava beans. 

As an illustrative embodiment, the activity of 
drug metabolizing enzymes is a major determinant of both 
5 the intensity and duration of drug action. The discovery 
of genetic polymorphisms of drug metabolizing enzymes 
(e.g., N-acetyltransf erase 2 (NAT 2) and cytochrome P450 
enzymes CYP2D6 and CYP2C19) has provided an explanation 
as to why some patients do not obtain the expected drug 
10 effects or show exaggerated drug response and serious 
toxicity after taking the standard and safe dose of a 
drug. These polymorphisms are expressed in two 
phenotypes in the population, the extensive metabolizer 
(EM) and poor metabolizer (PM) . The prevalence of PM is 
15 different among different populations. For example, the 
gene coding for CYP2D6 is highly polymorphic and several 
mutations have been identified in PM, which all lead to 
the absence of functional CYP2D6 . Poor metabolizers of 
CYP2D6 and CYP2C19 quite frequently experience 
20 exaggerated drug response and side effects when they 
receive standard doses. If a metabolite is the active 
therapeutic moiety, PM shows no therapeutic response, as 
demonstrated for the analgesic effect of codeine mediated 
by its CYP2D6-f ormed metabolite morphine. The other 
25 extreme are the so called ultra-rapid metabolizers who do 
not respond to standard doses. Recently, the molecular 
basis of ultra-rapid metabolism has been identified to be 
due to CYP2D6 gene amplification. 

Thus, the activity of Tango-77 protein, expression 
30 of Tango-77 nucleic acid, or mutation content of Tango-77 
genes in an individual can be determined to thereby 
select appropriate agent (s) for therapeutic or 
prophylactic treatment of the individual. In addition, 
pharmacogenetic studies can be used to apply genotyping 
35 of polymorphic alleles encoding drug-metabolizing enzymes 
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to the identification of an individual's drug 
responsiveness phenotype . This knowledge , when applied 
to dosing or drug selection, can avoid adverse reactions 
or therapeutic failure and thus enhance therapeutic or 
5 prophylactic efficiency when treating a subject with a 
Tango-77 modulator, such as a modulator identified by one 
of the exemplary screening assays described herein. 

4 . Monitoring of Effects During Clinical Trials 
Monitoring the influence of agents (e.g., drugs, 

10 compounds) on the expression or activity of Tango-77 

(e.g., the ability to modulate aberrant inflammation) can 
be applied not only in basic drug screening, but also in 
clinical trials. For example, the effectiveness of an 
agent, as determined by a screening assay as described 

is herein, to increase Tango-77 gene expression, increase 
protein levels, or upregulate Tango-77 activity, can be 
monitored in clinical trials of subjects exhibiting 
decreased Tango-77 gene expression, decreased protein 
levels, or downregulated Tango-77 activity. 

20 Alternatively, the effectiveness of an agent, as 

determined by a screening assay, to decrease Tango-77 
gene expression, decrease protein levels, or downregulate 
Tango-77 activity, can be monitored in clinical trials of 
subjects exhibiting increased Tango-77 gene expression, 

25 increased protein levels, or upregulated Tango-77 
activity. 

For example, and not by way of limitation, genes, 
including Tango-77, that are modulated in cells by 
treatment with an agent (e.g., compound, drug or small 
30 molecule) which modulates Tango-77 activity (e.g., as 

identified in a screening assay described herein) can be 
identified. Thus, to study the effect of agents on 
cellular proliferation disorders, for example, in a 
clinical trial, cells can be isolated and RNA prepared 
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and analyzed for the levels of expression of Tango- 77 and 
other genes implicated in the disorder. The levels of 
gene expression (i.e., a gene expression pattern) can be 
quantified by Northern blot analysis or RT-PCR, as 
5 described herein, or alternatively by measuring the 
amount of protein produced, by one of the methods as 
described herein, or by measuring the levels of activity 
of Tango-77 or other genes. In this way, the gene 
expression pattern can serve as a marker, indicative of 
10 the physiological response of the cells to the agent. 
Accordingly, this response state may be determined 
before, and at various points during, treatment of the 
individual with the agent . 

In a preferred embodiment, the present invention 
is provides a method for monitoring the effectiveness of 
treatment of a subject with an agent (e.g., an agonist, 
antagonist, peptidomimetic, protein, peptide, nucleic 
acid, small molecule, or other drug candidate identified 
by the screening assays described herein) comprising the 
20 steps of (i) obtaining a pre-administration sample from a 
subject prior to administration of the agent; (ii) 
detecting the level of expression of a Tango-77 protein, 
mRNA, or genomic DNA in the preadministrat ion sample ,- 
(iii) obtaining one or more post -administration samples 
25 from the subject; (iv) detecting the level of expression 
or activity of the Tango-77 protein, mRNA, or genomic DNA 
in the post-administration samples; (v) comparing the 
level of expression or activity of the Tango-77 protein, 
mRNA, or genomic DNA in the pre-administration sample 
30 with the Tango-77 protein, mRNA, or genomic DNA in the 
post administration sample or samples; and (vi) altering 
the administration of the agent to the subject 
accordingly. For example, increased administration of 
the agent may be desirable to increase the expression or 
35 activity of Tango-77 to higher levels than detected, 
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i.e., to increase the effectiveness of the agent. 
Alternatively, decreased administration of the agent may 
be desirable to decrease expression or activity of 
Tango-77 to lower levels than detected, i.e., to decrease 
5 the effectiveness of the agent. 

C . Methods of Treatment 
The present invention provides for both 
prophylactic and therapeutic methods of treating a 
subject at risk of (or susceptible to) developing or 

10 having a disorder associated with aberrant Tango-77 
expression or activity. Alternatively, disorders 
associated with aberrant IL-1 production can be treated 
with Tango-77. Such disorders include acute and chronic 
inflammation,, asthma, some classes of arthritis, 

is autoimmune diabetes, systemic lupus erythematosus and 
inflammatory bowel disease. 

1 . Prophylactic Methods 
In one aspect, the invention provides a method for 
preventing in a subject, a disease or condition 

20 associated with an aberrant Tango-77 expression or 

activity (or aberrant IL-1 expression or activity) , by 
administering to the subject an agent which modulates 
Tango-77 expression or at least one Tango-77 activity. 
Subjects at risk for a disease which is caused or 

25 contributed to by aberrant Tango-77 expression or 

activity can be identified by, for example, any or a 
combination of diagnostic or prognostic assays as 
described herein. Administration of a prophylactic agent 
can occur prior to the manifestation of symptoms 

30 characteristic of the Tango-77 aberrancy, such that a 
disease or disorder is prevented or, alternatively, 
delayed in its progression. Depending on the type of 
Tango-77 aberrancy, for example, a Tango-77 agonist or 
Tango-77 antagonist agent can be used for treating the 
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subject. The appropriate agent can be determined based 
on screening assays described herein. 

2 . Therapeutic Methods 

Another aspect of the invention pertains to 
5 methods of modulating Tango-77 expression or activity for 
therapeutic purposes. The modulatory method of the 
invention involves contacting a cell with an agent that 
modulates one or more of the activities of Tango-77 
protein activity associated with the cell. An agent that 
10 modulates Tango-77 protein activity can be an agent as 
described herein, such as a nucleic acid or a protein, a 
naturally-occurring cognate ligand of a Tango-77 protein, 
a peptide, a Tango-77 peptidomimetic , or other small 
molecule. In one embodiment, the agent stimulates one or 
is more of the biological activities of Tango-77 protein. 
Examples of such stimulatory agents include active 
Tango-7 7 protein and a nucleic acid molecule encoding 
Tango-77 that has been introduced into the cell. In 
another embodiment, the agent inhibits one or more of the 
20 biological activities of Tango-77 protein. Examples of 
such inhibitory agents include antisense Tango-77 nucleic 
acid molecules and anti-Tango-77 antibodies. These 
modulatory methods can be performed in vitro (e.g., by 
cultufing the cell with the agent) or, alternatively, in 
2 5 vivo (e.g, by administering the agent to a subject) . As 
such, the present invention provides methods of treating 
an individual afflicted with a disease or disorder 
characterized by aberrant expression or activity of a 
Tango-77 protein or nucleic acid molecule. In one 
30 embodiment, the method involves administering an agent 

(e.g., an agent identified by a screening assay described 
herein), or combination of agents that modulates (e.g., 
upregulates or downregulates) Tango-77 expression or 
activity. In another embodiment, the method involves 
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administering a Tango- 77 protein or nucleic acid molecule 
as therapy to compensate for reduced or aberrant Tango- 7 7 
expression or activity. 

Stimulation of Tango-77 activity is desirable in 
5 situations in which Tango-77 is abnormally downregulated 
and/or in which increased Tango-77 activity is likely to 
have a beneficial effect. Conversely, inhibition of 
Tango-77 activity is desirable in situations in which 
Tango-7 7 is abnormally upregulated and/ or in which 

10 decreased Tango-77 activity is likely to have a 
beneficial effect . 

This invention is further illustrated by the 
following examples which should not be construed as 
limiting. The contents of all references, patents and 

15 published patent applications cited throughout this 
application are hereby incorporated by reference. 



EXAMPLES 

Example 1: Isolation and Characterization of Human 
Tango-77 cDNAs 

20 Cytokine genes IL-la, IL-1/3 and IL-lra have been 

found to be closely clustered on chromosome 2, i.e., 
IL-la, IL-1/3 and IL-lra are located within 450 kb of each 
other. BAC clones containing IL-la and IL-1/3 were used 
to identify other proximal unknown cytokine genes. To do 

25 this, a BAC clone containing IL-la and IL-10 was selected 
from a BAC library (Research Genetics, Huntsville, 
Alabama) using specific primers designed against IL-lar 
and IL-ljS. The DNA from the BAC was extracted and used 
to make a random- sheared genomic library. From this BAC 

30 library, 4000 clones were selected for sequencing. The 
resulting genomic sequences were then assembled into 
contigs and used to screen proprietary and public data 
bases. One genomic contig was found to contain two 
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segments of sequences which resemble IL-lra. These two 
segments are potential exons of Tango-77 gene. 

Two PCR primers were then designed from the two 
potential exons and used to screen a panel of cDNA 
5 libraries for the expression of a Tango-77 message. A 
cDNA library from TNF-a treated human lung epithelia 
showed a positive band of the predicted size (i.e., if 
the two exons are spliced together) . Using the PCR 
fragment as a probe, a single cDNA clone was isolated 
10 from the same library. This cDNA contains an insert of 
98 9 bp. The cDNA clone contains three possible open 
reading frames. The first open reading frame encompasses 
534 nucleotides (nucleotides 356-889 of SEQ ID NO:l; SEQ 
ID N0:3) and encodes a 178 amino acid protein (SEQ ID 
is NO: 2) . This protein may include a predicted signal 

sequence of about 63 amino acids (from amino acid 1 to 
about amino acid 63 of SEQ ID NO : 2 (SEQ ID NO: 4)) and a 
predicted mature protein of about 115 amino acids (from 
about amino acid 64 to amino acid 178 of SEQ ID NO : 2 (SEQ 
20 ID NO:5) ) . 

The second putative nucleotide open reading frame 
encompasses 498 nucleotides (nucleotides 389-889 of SEQ 
ID N0:1; SEQ ID NO: 6) and encodes a 167 amino acid 
protein (SEQ ID NO: 7), This protein includes a predicted 
25 signal sequence of about 52 amino acids (from amino acid 
1 to about amino acid 52 of SEQ ID NO:7 (SEQ ID NO:8)) 
and a predicted mature protein of about 115 amino acids 
(from about amino acid 53 to amino acid 167 of SEQ ID 
NO:7 (SEQ ID NO:9) ) . 
30 The third open reading frame (nucleotides 372-889 

of SEQ ID NO:l; SEQ ID NO: 10) encompasses 4 08 nucleotides 
and encodes a 136 amino acid protein (SEQ ID NO:ll). 
This protein includes a predicted signal sequence of 
about 21 amino acids (from amino acid 1 to about amino 
35 acid 21 of SEQ ID NO: 11 (SEQ ID N0:12)) and a predicted 
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mature protein of about 115 amino acids (from about amino 
acid 22 to amino acid 136 of SEQ ID NO: 11 (SEQ ID 
NO: 13) ) . 

Tango-77 is predicted to be 35% identical to human 
5 IL-lra at the amino acid level. 

Example 2: Expression of Tango-77 mRNA in Human Tissues 

The expression of Tango-77 was analyzed using 
Northern blot hybridization. A PCR generated 989 bp 
Tango-77 product was radioact ively labeled with 32 P-dCTP 

10 using the Prime- It kit (Stratagene; La Jolla, CA) 

according to the instructions of the supplier. Filters 
containing human mRNA (MTNI and MTNII: Clontech; Palo 
Alto, CA) were probed in ExpressHyb hybridization 
solution (Clontech) and washed at high stringency 

15 according to manufacturer's recommendations. 

Tango-77 mRNA was not detected in any unstimulated 
tissues (brain, liver, spleen, skeletal muscle, testis, 
pancreas, heart, kidney and peripheral blood leukocytes) 
mRNA on Clontech Northern blots. 

20 Over 96 cDNA libraries were then tested for the 

presence of Tango-77 using PCR amplification. Only three 
libraries displayed a positive signal. These libraries 
were the TNFof-treated bronchoepithelium, TNFce- treated SSC 
cell line and ant i-CD3 -treated T cells. 

25 Example 3: Characterization of Tango-77 Proteins 

In this example, the predicted amino acid sequence 
of human Tango-77 protein was compared to the amino acid 
sequence of known protein IL-lra. In addition, the 
molecular weight of the human Tango-77 proteins was 
30 predicted. 

The human Tango-77 cDNA (Figure 1; SEQ ID NO:l) 
isolated as described above encodes a 178 amino acid 
protein (Figure 1; SEQ ID NO: 2) or a 167 amino acid 
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protein (Figure 1; SEQ ID NO: 7) or a 136 amino acid 
protein (Figure 1; SEQ ID NO: 11). The signal peptide 
prediction program SIGNALP Optimized Tool (Nielsen et al . 
(1997) Protein Engineering 10:1-6) predicted that 
5 Tango-77 includes a 63 amino acid signal peptide (amino 
acid 1 to about amino acid 63 of SEQ ID NO: 2 (SEQ ID 
NO:4)) preceding the 115 mature protein; or preceding the 
115 mature protein (about amino acid 52 to amino acid 167 
of SEQ ID NO:7 (SEQ ID NO:8)); or preceding the 115 

10 mature protein (about amino acid 21 to amino acid 136 of 
SEQ ID NO: 11; SEQ ID NO:12). 

As shown in Figure 2, Tango-77 has a region of 
homology to IL-lra (SEQ ID NO:14). 

Mature Tango-77 has a predicted MW of about 13 kDa 

is and the predicted MW for the immature Tango-77 is 19.6 
kDa, 18.5 kDa or 15.2 kDa, not including post- 
translational modifications . 

Example 4: Preparation of Tango-77 Proteins 

Recombinant Tango-77 can be produced in a variety 

20 of expression systems. For example, the mature Tango-77 
peptide can be expressed as a recombinant glutathione- S- 
transferase (GST) fusion protein in E. coli and the 
fusion protein can be isolated and characterized. 
Specifically, as described above, Tango-77 can be fused 

25 to GST and this fusion protein can be expressed in E. 
coli strain PEB199. Expression of the GST-Tango-77 
fusion protein in PEB199 can be induced with IPTG. The 
recombinant fusion protein can be purified from crude 
bacterial lysates of the induced PEB199 strain by 

30 affinity chromatography on glutathione beads. 
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Example 5: Alternatively spliced forms of IL-lra and 
Tango- 77 

Computer program Procrustes (Gelfand et al . , 1996, 
Proc. Natl. Acad. Sci . USA, 93:9061-9066) is an alignment 
5 algorithm that predicts the presence of alternatively 
spliced exons for a protein of interest in a stretch of 
genomic DNA. Using the IL-lra sequence, Proscustes was 
used to search for the presence of additional sequences 
that might encode for alternatively spliced forms of IL- 

10 Ira in the two overlapping BAC genomic sequences (see 
Fig. 3 and Fig. 4) . Potential sequences that encode 
variant exons for IL-lra were identified. These 
predicted exons aligned well with the N-terminal region 
of IL-lra, but were not present in Tango-77. The results 

is from Procrustes predicts the existence of more spliced 
forms of IL-lra. 

Furthermore, Procrustes also predicted an 
additional sequence in BAC1 and BAC 2 that encodes an 
alternatively spliced exon for Tango-77 (T77-procrustes ; 

20 Fig. 5) . This predicted splice variant form of Tango-77, 
T77-procrustes , was aligned with Tango-77 (Fig. 6) and 
with IL-lra and IL-1/? (Fig. 7) . 

PCR primers within this sequence can be used to 
generate a product that can be used to screen a panel of 

25 cDNA libraries using standard techniques. Suitable cDNA 
libraries include libraries made from TNFar- treated 
bronchoepithelium, TNFcx- treated SSC cell line and anti- 
CD3 -treated T cells. The resulting cDNA clone (s) can be 
isolated from the library and sequenced to identify 

30 additional Tango-7 7 cDNAs. 
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Equivalents 

Those skilled in the art will recognize, or be 
able to ascertain using no more than routine 
experimentation, many equivalents to t t he specific 
5 embodiments of the invention described herein. Such 
equivalents are intended to be encompassed by the 
following claims. 
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What is claimed is: 

1. An isolated nucleic acid molecule selected 
from the group consisting of: 

a) a nucleic acid molecule comprising a 
5 nucleotide sequence which is at least 45% identical to 
the nucleotide sequence of SEQ ID NO:!, SEQ ID NO: 3, SEQ 
ID NO: 6, SEQ ID NO: 10, the cDNA insert of the plasmid 
deposited with ATCC as Accession Number 98 8 07, or a 
complement thereof ; 

io b) a nucleic acid molecule comprising a fragment 

of at least 300 nucleotides of the nucleotide sequence of 
SEQ ID N0:1, SEQ ID NO : 3 , SEQ ID NO : 6 , SEQ ID NO: 10, the 
cDNA insert of the plasmid deposited with ATCC as 
Accession Number 98807, or a complement thereof; 

is c) nucleic acid molecule which encodes a 

polypeptide comprising the amino acid sequence of SEQ ID 
NO: 2, SEQ ID NO : 4 , SEQ ID NO : 5 , SEQ ID NO : 7 , SEQ ID NO : 8 , 
SEQ ID NO: 9, SEQ ID NO: 11, SEQ ID NO: 12, SEQ ID NO: 13, or 
an amino acid sequence encoded by the cDNA insert of the 

20 plasmid deposited with ATCC as Accession Number 98807; 

d) a nucleic acid molecule which encodes a 
fragment of a polypeptide comprising the amino acid 
sequence of SEQ ID NO:2, SEQ ID NO: 4, SEQ ID NO: 5, SEQ ID 
NO:7, SEQ ID NO : 8 , SEQ ID NO : 9 , SEQ ID NO:ll, SEQ ID 

25 NO: 12, SEQ ID NO: 13, wherein the fragment comprises at 
least 15 contiguous amino acids of SEQ ID NO: 2, SEQ ID 
NO: 4, SEQ ID NO : 5 , SEQ ID NO: 7, SEQ ID NO : 8 , SEQ ID NO : 9 , 
SEQ ID NO: 11, SEQ ID NO: 12, SEQ ID NO: 13, or the 
polypeptide encoded by the cDNA insert of the plasmid 

30 deposited with ATCC as Accession Number 98807; and 

e) a nucleic acid molecule which encodes a 
naturally occurring allelic variant of a polypeptide 
comprising the amino acid sequence of SEQ ID NO: 2, SEQ ID 
NO: 4, SEQ ID NO : 5 , SEQ ID NO : 7 , SEQ ID NO : 8 , SEQ ID NO : 9 , 
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SEQ ID NO: 11, SEQ ID NO: 12, SEQ ID NO : 13 , or an amino 
acid sequence encoded by the cDNA insert of the plasmid 
deposited with ATCC as Accession Number 98807, wherein 
the nucleic acid molecule hybridizes to a nucleic acid 
5 molecule comprising SEQ ID NO:l, SEQ ID NO: 3, SEQ ID 
NO: 6, SEQ ID NO: 10, or the complement thereof under 
stringent condi t ions . 

2 . The isolated nucleic acid molecule of claim 
1, which is selected from the group consisting of: 
L0 a) a nucleic acid comprising the nucleotide 

sequence of SEQ ID NO : 1 , SEQ ID NO : 3 , SEQ ID NO: 6, or SEQ 
ID NO: 10 or the cDNA insert of the plasmid deposited with 
ATCC as Accession Number 98807, or a complement thereof; 
and 

15 b) a nucleic acid molecule which encodes a 

polypeptide comprising the amino acid sequence of SEQ ID 
NO: 2, SEQ ID NO : 4 , SEQ ID NO : 5 , SEQ ID NO : 7 , SEQ ID NO : 8 , 
SEQ ID NO: 9, SEQ ID NO: 11, SEQ ID NO: 12, SEQ ID NO: 13, or 
an amino acid sequence encoded by the cDNA insert of the 

20 plasmid deposited with ATCC as Accession Number 98807. 

3 . The nucleic acid molecule of claim 1 further 
comprising vector nucleic acid sequences. 

4 . The nucleic acid molecule of claim 1 further 
comprising nucleic acid sequences encoding a heterologous 

25 polypeptide. 

5. A host cell containing the nucleic acid 
molecule of claim 1. 

6. The host cell of claim 5 which is a mammalian 
host cell . 
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7. A non-human mammalian host cell containing 
the nucleic acid molecule of claim 1. 

8. An isolated polypeptide selected from the 
group consisting of : 

5 a) a fragment of a polypeptide comprising the 

amino acid sequence of SEQ ID N0:2, SEQ ID N0:4, SEQ ID 
NO: 5, SEQ ID NO : 7 , SEQ ID NO : 8 , SEQ ID NO : 9 , SEQ ID 
NO: 11, SEQ ID NO: 12, SEQ ID NO: 13, wherein the fragment 
comprises at least 15 contiguous amino acids of SEQ ID 
10 NO: 2, SEQ ID NO : 4 , SEQ ID NO : 5 , SEQ ID NO : 7 , SEQ ID NO : 8 , 
SEQ ID NO: 9, SEQ ID NO: 11, SEQ ID NO : 12 , or SEQ ID NO: 13. 

b) a naturally occurring allelic variant of a 
polypeptide comprising the amino acid sequence of SEQ ID 
NO: 2, SEQ ID NO : 4 , SEQ ID NO : 5 , SEQ ID NO : 7 , SEQ ID NO : 8 , 

15 SEQ ID NO: 9, SEQ ID NO ill, SEQ ID NO: 12, SEQ ID NO: 13, or 
an amino acid sequence encoded by the cDNA insert of the 
plasmid deposited with ATCC as Accession Number 98807, 
wherein the polypeptide is encoded by a nucleic acid 
molecule which hybridizes to a nucleic acid molecule 

20 comprising SEQ ID NO:l, SEQ ID NO : 3 , SEQ ID NO : 6 , SEQ ID 
NO: 10 or the complement thereof under stringent 
conditions; 

c) a polypeptide which is encoded by a nucleic 
acid molecule comprising a nucleotide sequence which is 

25 at least 55% identical to a nucleic acid comprising the 
nucleotide sequence of SEQ ID NO:l, SEQ ID NO: 3, SEQ ID 
NO: 6, or SEQ ID NO: 10. 

9. The isolated polypeptide of claim 8 
comprising the amino acid sequence of SEQ ID NO : 2 , SEQ ID 
30 NO: 4, SEQ ID NO : 5 , SEQ ID NO: 7, SEQ ID NO : 8 , SEQ ID NO : 9 , 
SEQ ID NO: 11, SEQ ID NO: 12, SEQ ID NO: 13, or an amino 
acid sequence encoded by the cDNA insert of the plasmid 
deposited with ATCC as Accession Number 98807. 
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10. The polypeptide of claim 8 further comprising 
heterologous amino acid sequences . 

11. An antibody which selectively binds to a 
polypeptide of claim 8. 

5 12 . A method for producing a polypeptide selected 

from the group consisting of: 

a) a polypeptide comprising the amino acid . 
sequence of SEQ ID NO : 2 , SEQ ID NO : 4 , SEQ ID NO : 5 , SEQ ID 
NO : 7 , SEQ ID NO : 8 , SEQ ID NO : 9 , SEQ ID NO: 11, SEQ ID 

10 NO -.12, SEQ ID NO: 13, or an amino acid sequence encoded by 
the cDNA insert of the plasmid deposited with ATCC as 
Accession Number 98807; 

b) a fragment of a polypeptide comprising the 
amino acid sequence of SEQ ID NO: 2, SEQ ID NO : 4 , SEQ ID 

15 NO: 5, SEQ ID NO : 7 , SEQ ID NO: 8, SEQ ID NO: 9, SEQ ID 
NO: 11, SEQ ID NO : 12 , SEQ ID NO: 13, or an amino acid 
se q U ence encoded by the cDNA insert of the plasmid 
deposited with ATCC as Accession Number 98807, wherein 
the fragment comprises at least 15 contiguous amino acids 

20 of SEQ ID NO:2, SEQ ID NO : 4 , SEQ ID NO : 5 , SEQ ID NO : 7 , 

SEQ ID NO: 8, SEQ ID NO : 9 , SEQ ID NO: 11, SEQ ID NO: 12, SEQ 
ID NO: 13, or an amino acid sequence encoded by the cDNA 
insert of the plasmid deposited with ATCC as Accession 
Number 98807; and 

25 c) a naturally occurring allelic variant of a 

polypeptide comprising the amino acid sequence of SEQ ID 
NO: 2, SEQ ID NO : 4 , SEQ ID NO : 5 , SEQ ID NO : 7 , SEQ ID NO : 8 , 
SEQ ID NO: 9, SEQ ID NO: 11, SEQ ID NO: 12, SEQ ID NO: 13, or 
an amino acid sequence encoded by the cDNA insert of the 

30 plasmid deposited with ATCC as Accession Number 98807, 
wherein the polypeptide is encoded by a nucleic acid 
molecule which hybridizes to a nucleic acid sequence of 
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SEQ ID NO:l, SEQ ID NO : 3 , SEQ ID NO : 6 , or SEQ ID NO: 10 
under stringent conditions; 

comprising culturing the host cell of claim 5 
under conditions in which the nucleic acid molecule is 
5 expressed. 

13 . A method for detecting the presence of a 
polypeptide of claim 8 in a sample, comprising: 

a) contacting the sample with a compound which 
selectively binds to a polypeptide of claim 8; and 
10 b) determining whether the compound binds to the 

polypeptide in the sample. 

14. The method of claim 13, wherein the compound 
which binds to the polypeptide is an antibody. 

15 . A kit comprising a compound which selectively 
is binds to a polypeptide of claim 8 and instructions for 

use • 

16. A method for detecting the presence of a 
nucleic acid molecule of claim 1 in a sample, comprising 
the steps of : 

20 a) contacting the sample with a nucleic acid 

probe or primer which selectively hybridizes to the 
nucleic acid molecule; and 

b) determining whether the nucleic acid probe or 
primer binds to a nucleic acid molecule in the sample. 

25 17. The method of claim 16, wherein the sample 

comprises mRNA molecules and is contacted with a nucleic 
acid probe. 
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18. A kit comprising a compound which selectively 
hybridizes to a nucleic acid molecule of claim 1 and 
instructions for use. 



19. A method fo ( r identifying a compound which 
5 binds to a polypeptide of claim 8 comprising the steps 

of: 

a) contacting a polypeptide, or a cell 
expressing a polypeptide of claim 8 with a test compound; 
and 

L0 b) determining whether the polypeptide binds to 

the test compound. 

20. The method of claim 19, wherein the binding 
of the test compound to the polypeptide is detected by a 
method selected from the group consisting of: 

15 a) detection of binding by direct detecting of 

test compound/polypeptide binding; 

b) detection of binding using a competition 
binding assay; and 

c) detection of binding using an assay for 
20 Tango-77 -mediated signal transductions 

21. A method for modulating the activity of a 
polypeptide of claim 8 comprising contacting a 
polypeptide or a cell expressing a polypeptide of claim 8 
with a compound which binds to the polypeptide in a 

25 sufficient concentration to modulate the activity of the 
polypeptide . 
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22. A method for identifying a compound which 
modulates the activity of a polypeptide of claim 8, 
comprising : 

a) contacting a polypeptide of claim 8 with a 
5 test compound; and 

b) determining the effect of the test compound 
on the activity of the polypeptide to thereby identify a 
compound which modulates the activity of the polypeptide. 
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>Con:igi 

GAAGTGAAGATATAATGTATAGTAGTAATATATAATGTTAGGTGAATfAA 

AGGAAATAGAATATATTGGGGAGTAATTATGGGTGTAAAGAAATATAGTA 

GGGAAGTATTTAGATTTGAGAAAAAAAAAAAGGAATTTAGTGTAGGTGAA 

JIAATAAAAGNANAAGGTTAAAAATTAAAAAAAAATTAAATATAAATAAAT 

AAATAAAAATAAAAATAAAATAAAAAATTTAAAAAATTAAAAAAATATAA 

AAAATAAAGAAATGGAAGTGGATTCTTAGAAAAAAAAGAAAGTAAGGTGA 

TATGAGGAGATAGAGAGGATGTGGTGTGAGATGATTGGTTTAATTAGAAA 

ATAGGTTTTGAATAGAGTGGGAAAGTAGAGTTTTGGTAAATGTGGGGGGA 

AGAGGGTAATG7TGTTTGAGTGAAAGAAAAAATGGTATATTTTTATAAAA 

TAATGAGGAAAGTGTGTGAAAAAAAAATTATTGGGATTTGGGAAGGTGAT 

ATATAAAGTTGTGGAAAATTTGGGGGGTGGGGTTTATTTAGGATTAAAAA 

GTTATTTAAAGAATGAAAATGAATTTTTGTTTGTAATTTGGGGATAAGAA 

ATTAATGTTTAGAAAGAAAGGGAAAAAATTGAAGAAAAAAATTTAGATTT 

TGGAAATTTAAAAATATTGTGGGTGTAAATAGGAAGGATTTTTAAAGGTA 

ATTGTGGAAGGGATTTGTGTGGAAAATAATAGGGAGAAAAAATGGGG 

>Connig2 

GCATCTAACTGGAGCCTGCATTATTACAGATTTAGCATCACCAAAGTCTA 

AACAATTAGACTGACTAAGGCAGAACTGCCCTTATGACAGCAGACATAAG 

AAGGAAAAGGCC^AAACACTGTGTTAAAAATTATCCAAATGTGAGGAA^ 

GGCAAAGAGAGTAGGTGTGCCTTTTTAGTGTCTAAGCTGCCTGCCCAAGG 

GGCATCTGATGCTCTCAGGCAGGAGTCCACAAATTTTTTTTTGTAAAAGA 

7CAGATAGTAAATCTTTTCAGCGTGAAGAGCATGAGGTCTCTGTCACAAA 

TACT C AAC C AC C ATT AC AACATGAAAG C AGC C AAC AG AC AAC ACATGAC A 

AATGAGTGTGGCTGTGTTCCAGTAAATCTTGATTACAAAAACAGGCAAGA 

GGCCAGAGCTGACCCATGGGCCATAGTTTGCTGACCCCTTCTGTAAAGGA 

AAGTATTTTTGTTTGACTTGCTGTTTACCATTGATTGAACACAAGGCTCT 

GTAAAGTTACTTGTTAACTTGCAGAAGATTGATGAGTGGCAAGTAATTTT 

TATTC AC CAGAATATAAAATTATTTCTGTTC AGTAGAAAAGAT AAAC CAA 

CTGTGATATTATGGTCCTG 

>Contig3 

GGGGTGTCTGTCTACCATGTGCTCGCAGTTCTGTAATAAATGTTCTCTCA 
AGATCCTTAAAATCTCTTGGAAATTATAAAAATATTGGAAAGAGAAGAAC 

GCTCTGTCGTCCAGGCTGGAGTGCAGTGGCGCAAACTTGGTTCACCACAA 

CCTCTGCCTCCCGGGTTCAAGCGATTCTTCTGCCTCAGCCTCCTGAGTAG 

CTGGGACTACAGGCGCCCGCCACCACGCCCAGCTAATTTTTGTATTTTTA 

GTAGAGACGAGGTTTTACTATGTTGGCTAGGCTGGTCTCAAACTCCTGAC 

CTTGTGATCTGCCCGCCTTGGCCTCCCAAAGTGCTGGGATTACAGGTGTG 

AGCCACTGCACCTGGCCAGTTTTTTAAATATATTTTTAAAAACACTTGAA 

TAAGAGTCAGTGTAAACTAGAAGTTTAAAAATGCTTCACAGAACACCCAG 

GGTTTACATTACAAGATTCTCACAACAAACCTATTGTAAAGGTGAGTAAG 

GCATGTTATTACAGAGAAAAGTTTGGGAGCAAAACTGTAAAAAATTATAT 

TTTTGTTGTATTTTCTAAGAGAAAGAGTATTGTTATGTTCTCCTAACCTC 

TGTTGATTACTACTTTAAGTGATTTCCTTGAGAGCACATGATGATCC 

>Contig4 

GCCGTTCATAGAAAACTGAAAGCAATAAGATGACTAGGTAAGCATGACAT 
TTAAAAGGTATTC ATGGGACGTGGTTACAAAAC CAACTCACAACTAAAAA 
GTCTTAGGACCTCTCGCTGACTTAGGAGCCTGATCCCAACTCTGAGAATG 
ACTCAGTGTGTTACCCTGTGGCTAGTGTAGACCAATGATCCTGTCTCAGA 
GTCACTAGCCAACAGCCCATATCAAGTACTTGAAACTTTGACTCAGAAAC 
CTCAGTGTCAGAACCTTTGACCTAGGAACCACCTGTAGTGGTTAACTGCA 
ATTTGCACCCCTTAGTTCAGGGCTTTACAACACCGGGGGCGGGGAGGGGA 
AAGGCATANANCTGATGACCTAAAGGAAACCCATTGCAGCAACGCTTTTG 
TGTTAAGTGTACAAATAAGTGTTGTTTTAGAATCCTCCAGGTAATGCCTT 
TGTTATTTAATGTGTCTGAGACAATTCTGCACATTAAAGAATATAAAATA 
TTACCTTGTAATTCCAATTTGAAATGTGTAATTGACATTAGACTTCTATT 
TGAATTTGAAATGTCTAAAACAATGTGGTTAAGTTTGTAAAAGGTGTGTG 
AATTTTGAGTCTGATTTACTACATTTTTTTTTAATTTTCTTTTTTTTTGG 
AGTTTTAGGGATTGCTTAGATGGCTAGAAAGATTTTATTCATCAGATTTT 
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t-aAGTCTGCCTTGGCAGGCACTTGCAG^ ^TTTGAAAGAATCAGATi«TATC 
AAATTTGTAGTTTAAAATATTTAAGGGAACTCAATTAACTATGCTAGAAA 
AGAGAATTAAGTATTTAGGAGGATTTAATATGGTGTGAAAGTTGTGAAAA 
^CAAAATGGAGACACTAATGTTAAGAAAACCCTGATAAATGGAACCAGGG 
AAAGGCATGAAGATAGAGTTCTCACACTTGTATCCCTGATCATGAAAAAG 

ATCTGC 
>Contig5 

GGGTTTTTCCGCGTTTTTACCCGAAATCTTCAAGGGATGGGAAAAAGAAA 

ATTGCTAAAAAATCTCGGTTTTTTGGTTTTAACAGATATTTACACCNTGG 

ATCCCATTTATTATGTTGTCCCCAAGGTTTTCGGTGGGTTCCCAATCAGT 

TAGCCCCCCTCCACAGTGAAAGCACTTTACTTTATCACCTTCACCTAAAG 

CATAAAATCCAGCTCTTGAAAGCTGCTCCTTGTTAACTGAATATATCCAC 

ATCCCAAAAGTAATGATCCATGCTTCATAATCTGCCACGGATGGATGGAT 

GGATGGATGGATGGATGGATGGATGGATGAATGGATGGATTGATTTCTTG 

GAGGATTTGTTGAATTTGGGAAATTCCACGCCAGGACAGCTGGCCCAAAC 

^GCCCGCGACAATCTGCTCGGTACAAGGGGAGGGTCCTGGAGAGGGTGCG 

GCCCGAGCCCCAGTTTGGAAATGCCAACTTGGCTCTGCAGCCGGGCCTTA 

GCCACTTGGGTCTGGCGTCCCTCCATTATTAGCGCCATGCCGGCTCGGGG 

TGCTGCCAAGTCCCTGAGAGCACAAGCC 

>Contig6 

CGCGCTCAAGAAAAGCTGAAGTGTGAATGTTCTGTCTACCTTCACAGTAA 
ATGCT AAG AGAATG AC C CAAG AG CAGAGGGT ATCACTCTG CT AC GGAGG A 
^TGATTGTAACTGGCTCTCCTGCCTTAGCAAGAAATGCCAGAACCATGGT 
CATTCAAGTTCTTGACCAAAAACTGCCTTCATGAGAATCAACTTCCCCAA 
GAAAAAAAAAGCAGAAACAGGCAAAGCTTCCAGCATGGTAGGTAATACTG 
ACCCTTCTTCCCTCCTTCCTTTGGAGATTCACACAGTAATAATGCATAAA 
GCTTTGCCAATGGACTAAGCACTGCCCAGGGGTTTTTGTCATGCCTGGAC 
TGAAATGCTCTTTTTGCGTTATCATAGAATCCCAGTGCAGTCTGAGTAGA 
CTCTAAGCAAAAGGGACATTTTTCAAAAAGGCTTTAAATTGCTAGTACAA 
AGAAGGCAACAAAACTTGCGTAACTGTGGACAGATTAACTCACTTGGTGT 
TTTGGCTCTTCAGTTTTCCCTTGGCTGCGAAGTACTCCTGAAGCTTTCTC 
TGCGGCTCTTCCTGCAAGCAGGCAAGCAAAAAAACGACTGAACTTTATTT 

CGAGAT 
>Contig7 

GAAGAGCCGCTAACTTGCTGTAGTGATAAGGAATGAACTAAGGCTAGGGA 

CATATTAACATCCGCTGGTGGTGACTCTTTAGCCTAGATCTTACCCCACT 

^CTGCTCCTTCCATATGGTTCGGTCTCAGGCTCACTACCGATCAATGGCG 

"ACTAAAAGCACTAACTATAGACTCCAACACGTCTGTCGTGTGTTTCACG 

ACAAGCCGTGGAGTTAATCCCTCTGACAGTAGCTCAGATAAGGATGGGCT 

ATCATGGGCCCGGAACTGGGGCATGACGCTCGTCACCAACGCATGAGCTC 

CCCAAGTATGCTATACCTGTCCCTATGAAGGGCTTCCAACTCTATGTGCA 

GTCCCCATGTGGAGAGTCAGGTATTGATTGATCAAGCCAGGGGTGTGGTG 

AATGGGGAGCTTCCTACAGGGGTAATGATAATTGAAATGCACGGTGATGG 

GGATTTTCATATTGGTCTCCTAAGGAGATAACAGATTGGATGCGGGGTCG 

ATATTCCACTGCCCAGGGTGTGTACCGAGGGTATCTGCAGGTGGATCTCC 

TCCCCACGTTTGATTAATACTCCTGTCTTGGGAAGCATAGACGGGCGGGG 

GAAATGATGAAGGGTGACCACTCCCC 

^Contiga 

GGGAACGCAGTGCTCTGTACGATGGCCTTGATTGCGAATTCCTGCAGGGG 
GGG 

>Contig9 _.. U n.-. 
GGCAAGAGATTTAATATTCATTCCATCTTCATTTGGAAGATGAAAAATTG 

GGGACCAGAGAGGGGAGGGGACTGGGCCAAGTTTTCAAAGAAAAGTCAGT 

AGGAATTGTGAATTCCTGGGGGCCGGGGCCCATTAGTGCTGTTTTGGATC 

AGTAAATGGAGATGTGAGTTTCAACAGTAACAGGGACATTTTAAAATTAA 

AATGATTTAACCTTTAGAAAATGTCCTATTTTGTAATAATGATGGATTCA 

CAGGAAGGTACAAAGAAATGTCCAGAGAGTTCNTGAGCCCCCTTCAGCCA 

GCTTCTTCCAATGTTAACATCTTGCATTATTATAGTACAACATCAAAACT 

GGGAAATCGATATTGGTACTGTCCAGATAGCTTACTCAGATTTTGCCAGT 

~ATAC — CCACTCATTTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTG 
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TGTGTGTAGCTCTATGCrtATTTTATGl ^ * GTAGCTTCATGTAAGC^CG** J 

AATCACAATACTTAACTATGCCCTCATCACAAGACTCTCTCTTGCTATGC 

TTTACAGCTGTATCCTCTTCATCTCCAAACCCTAAGCCCACCTCACCGCC 

TCCACCATCTCTAATCCCTGGCAACCACTATTCTGTGCTCCATCTCTGTA 

ATTAATTGTGTTAATTAATGTTATACAAATGGAATCATGAAGTATGTGTC 

CTTTGAGATTGGGCTGTTAATTTTTCACTCAGCACAATTTCCGTGAGTCT 

AATCCAACTTGTGTGTAGCAGTAATTCTTTCCTTATTATTGCTGAATAAT 

ATGCCATGGTATGGATGTATCACAGTGTGTCTAATCCTTTGCCCATTGAA 

AGGAATTTGGATAATTTCCAGGTTTTGGCTATTATGAATAAAGTGAACAT 

AAGACATGTGTGTACAAATTTTGGTGTGATCAAAAGTCTCATTTCTCTGG 

GATAAATGCCCGGTAATGAAATGGCTGGGTTGTGTGGG 

>ConciglO 

G CAAG AAC ACAGG CG CGTATTAT AAC CTTACTAC CAAGACCTGAAC C CAT 

ATAAAGGTTTATGCGTAACAATCATCATCCCTGTTCCAGAAGATTACACG 

TACGACCACGCCTGGCTCACCGACTCACGTGGGCCAGTACCAGAAATTCT 

CCCAAACAAACAGTCGTGTCTGAAAACAATCGCGGTQACCTQCACGGTTA 

GAAAAGCCTGTTTTCAAGTCCTGGAATTGCCACATATTAGCTGGGTAACT 

TTGGGCATCACATTTACTCTCTCCGAATTTCAGATTGCAAAAACTCATTG 

GATTGTTTTGTGGATTGAAAGAAATAATGTAAATTTAGGCCGAGTGCTTT 

GACTTACGCCTGTAATCCTATCACTTTGGGAGGCCAAAGCAGGAGGGTCA 

CTTG AG C T C AGGAATTTGAGAC CAC CTCTGGC AACAT AGTG AGAT C CTGT 

CTCTACAAAAAATTTTTTTTAAATTATCCAGCATGGTGGTACACGCCTGT 

ATTCCCAGC7ACTCAGGAGACTGAGGTGTGAGGATTGCTAGAACCTGGGA 

GATCAA.GTCAACAGTGAGCCGTGGTTGTGCCACTGCCCTCCAACCTCAGT 

GACAGAGGAAGACCCTGTCTCAAAAAAAAAAAAAAAAGTAGTAAGTTTAA 

AGAACTTAGTGTAGGCCTGGCATATAAATGATATTGTTGATGTTGATGTT 

AGCTTGAAGGCACATTTATAGGAGTAGGGATTTTATAACATTATGAGCCT 

GAGAGCACATATAATGTTCCC 

>Contigll 

GGTCTAACATGCTCCAACTGAAGAAACCCCACACTTGTCCGGCAAGGAAA 

CTACTACAGATTTCCTGACCTACTGTGCAATTCGGGGCATGCGACGGGAC 

TGTGTTTCTGGGTACGCTGTCTCAGGTTCGTCTGGGATGTAAGAATTCAA 

CTTCAGTAGTTCTCTCATAGACGCCGACGAGAGGGGCGTCTCTTTTCTCT 

GATGAATCTGCCAGATCTTCCACTTCATAGAGTCTAAATCCTCCGATTCG 

ATCTACTGGAGACCCCCACGTTACAAAAACGTCTAACGTCGGTGACAGCT 

C C C C AC AT AGGG AAAG ATCAC CTG AGTCTCACT AC CT CACATTAGTGCTA 

TCTCCAGCCCCATGCTATCTACGAGATGGTCACGCGAGGTTTAAGGGGTC 

TCCGATTCCGGTGGTCCGATTCAGCTAATCGTGGCCCTACGTGAACGATC 

ACTCCTGCTCGTAACATCGATACAGGGTCGCGCTGACAAATGGTACTACG 

TAGGTTCTCAGGTCAATGCCGCGTCACGAATGAGCCTAACTACCCCATAA 

GTGCACGTACTGTGTTACCTTTCCTGTTCGGCCAAACCTGCTACTGTATG 

CTGTGCTTGTTT 

>Contigl2 

AGGCTCCATGTGCTCTAGCCTGATTATCTTTTCAAGTGTTTTATTTGCTA 
ATCTATAAGGCCCCTTTCGTAAAATGTTCACTCATTTTCTAATTAGATAT 
TTTTTTTAATGTTGAGTTTTGAGAGTTCTTTAGATATTTTAGATACAAGT 
CCATTGTCAAATATGTGATTTACAAATATTTTCTCTCAATCTGTAATTTA 
GTTTTCATCCTCTTAACAGGGTCTTTTGGAGAGCAAATAATTTGATTTTC 
ATAAGGTTCAAATTATTAATTTTTTCTTGTATAGTTCACACTTCTAGTGT 
TAAGTCTAAAAACTGTGCCTTGTCATAGGTACCAAAGGTTTTCTCCAGTT 
TTTTTTCTAGAAGTTTAGAGTTTCATGTTTTACATTGGAGTCCATGATCC 
ATTGTTAATTAATTTTTGTATATAGGTAGATGTTTAGGTTTAGGGTTTTT 
TTAAAAAAAAATTACATATGTTTAATTGCTCCAGTTCCCTTTCATTGAAA 
AGGGTATCCTTCCTCCATTGAATTGCCTTTGTCAGAAATTAATTGGACAT 
ATTTGTGTGAGTCTATTTCTGGGCTCTTTATCATGTTACTTTTAAAAAAT 
GCATCAGTTCCTCCACCAATACCTCATTGTCTTGATTATTGCAGTTATAT 
AGTAAGCCTTAGCATTAGGAAAAGTGTTTTTCCTGCTTTATTCTTTNTCA 
AAAAATTTTTGGATATTCTAGGGCCTTTACATATAAATTTTAAAATAACT 
TTGTCTATGTCTAACCGAAAGCCTTATGAAGATTTTGATAAGAATTGCAT 
TATGCCTATACATTAATTTAAAAAGAACTGATGTCTTTATTCAGTTGATT 
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CTGCTAATCTATGAACAl^GCATCTCTv. . CAAAGCATTTAGTCTT TCTT . 
AATTTCTGTCATTAATTTTTTAAAATTTTCATCCTAAAGATTCTGTATAT 
GTTTTGTTGAATTTATGCTTAAGCATTTCACTTTCTTGGTAACAATTATA 
AATGATTT7GTGTTTTTTATTCCACTAGTTCATTTTCAGTGTGTAGAAAA 
GCAATGAATTTTTGTGTGTTGATCTTTGTTCCTACATCTTGCAACATTAT 
TGAACTCATTTATTAGTTCTAGGAGGTTTTTTCATTTTTCTTGTAGATAC 

cttgagattttctatatagacagtcatgttgtctgcaaacaggcacagtt 
ttXtttcttccttttcaatctatatgcctttttttttttttttgccttat 
tgcagtggctagaacttctagcactatgtcaaatagcattggtgaaagca 
gacatccttgttccttgtcttagaggaacatttggtctttaatcttggat 

TGCG 

>Concigl3 

GCGCCTCCTTTTCTCTTCCAAAATTTCTCTTGTCTAGTTATTTGTCCAGG 

GAAATTTGAAAGCTCACTTACTGTGCAAGTCAGCAGGAAACAACTGGGTC 

TGTGCACAGCACCTAGCAAAGTTCTGCTCTAGGAATTACACTTTGGCCCT 

GAGGTAGATTTCTACAAGAACCTTACCTTCTAAGCAGCACTGGGGTTCAT 

CTTTTTCCCAGTCCTCAGAGCCCATTTTCACTCCTGAGTTCTCCCCCACA 

AAGGACATTTTCAACGTTGAGTTTATTACTCAACAGAAAATGGAATGAAG 

TCCAAGACCTAAGGAGATAGAAAGGGGACCAGTTATGGCATCTTCTCACC 

CCAGGACACCTTGCTGCATGTCTCTAGTGCTGAACAGACCACTGGCCTTG 

CTCTGTAGTTTGAAATGCTCGCTGCAACCAGAAAGGCACCAAGGGGCCAG 

ACCATGCTCTCCTGTCTATCACGCCTTCAAAGCAGAATTTCCCAAACCTT 

GAGTCACAGTGCTAACACACGGGGTGCCATAACATTTTTGTTGATTTTGG 

CATTTTACAAAAATAAAATAAAAAAGTTAAAAATGCATTGCTCTATTCTT 

GGGGCTGGCACACTATTGCCTTTGGCCAAATCCGGTCCCTGACTGTTTTT 

TTAAATAAAGT7TTATTGAAACACAACCATGCTCTTGTGTACATATTGTC 

TCTTGGCTGCTTCGAAGCTACAATA 

>Concigl4 

GTGTTCGCTTTTTAACACTT AC CTAAAATTACTCTGTAAT C CATGGATC C 

TTAATTTATTTAAAAAACTAATGTTAATGAGTAGCTTTATTTTCCTCCCA 

TCTAATTTAAGGCCCACAGAACACCTTCACTTACCTCAATCCTCTCCCAA 

CTTACATGCTTTTAATGTCATATATGTTAATACCGTATACTTTTAAAACT 

TTCTAAAATAGCATTATTTTATAGCATGAGTGTTCATTTACATTTTTGCA 

TATATTTAGAATTTTCTTTGCTCTTCGTTTCTTCTTCTATTTATGACTCC 

CCTCTGGGATCATTTTCCTTCTACTTGAAGTACATAGTTTAGAACTGCAC 

TATTCAATACAGTAGCCACTAGCCATGTGTAGCTATTGAAGTTTAAACTA 

AGTAAAATTGAGTAATATTAAAAACTCAGTTCCTTCATCTCACTAGCCAC 

ATTTCAAGTGCTCAGCAGCCACGTGCGACTAATGACTACTGTACATCAAA 

CATATAGAACATTTCCATCATGGCAAAGAGCTCTATTGATAGTGTTCATC 

CAGAGTTTCTGTTCCAGGACCAAACTGAGGGTTGGGCTGCTATTTCTCAT 

GGCCCAATAACAAGATGCAGATGAGCTGGGGAGGAAGAGAGTTTTTATTT 

CTGCNACCATTTACCGGGAGAAGGCCTGGAAATCATCACCAGGCCAACTC 

AAAATTATTACGTTTTCCAGAGCTTATATACCTTCTAAGCTATATGTCTA 

CGTGTAAGTGTGCATTCACCTGAAGACGTTAGTGATTAACTTCTTTTAAT 

CTGTAACTAAGGTCTGAGTCCGGAAGATCTTCCCCTGGAGCCTCAGTAAA 

TTTACTTAATCTAAATGGGTCCAGGTGCTGGGGTAATTACCCTTATCTTG 

TCCCCTGCTAAATCATGGAGGTTTGGGGATTCCTTTAGAGCACCAATAAA 

CTTGTTTGTGGAGGCCTGGGGGTTTCTTCTGACCCACAATAAAACTTGTT 

TAATCCTAAATGGGTCCTGTTAAGAATTCCTTCTTTATTTTGTCATATTT 

TAAGGCCCAGAAAAGGCCTGGGCAAAACTCTTGATGGGCTTTTGTTACAT 

TCCAGCCTTTGTATAAGAACACTGGTTTTTAATATTTAACTTAACCATTT 

AGTCAGTACTGAAACAGTTGTTATAGAGATCTGCATTAGTGAGACCTGGC 

CTGCCACATTTCCTTTTCTGAAGATCTTATGGTAGTGATCACCTTTGTGA 

AAGGAAAATAAATCTTGGGACCTCAAAATCACTAAGCCAAAGAAAAAAGT 

CAAGCTGGGAAGAATCTGACACTTAAATCCAACACTGCTAACTCATTCAT 

CTCACTCATTCATTCATTTTATTTTCTTTTTTCTTTCTTTTTTTTTTTTT 

TTTTTTGAAACGAAGTCTTGCTCTGTCACCCAAGCTGGAGTGCAGTGGAT 

CTCAGGTCACTGCAACCTCCACCTCCCGGGTTCAAGCGATTCTCCTACCT 

CAGACTCCTGAGTAGCTGGAATTACAGGCACCTGCCACCACGCCTGGCTA 

ATTTTTATATTTTTAGTAGAGACGGGGTTTCACCATGTTCATCAGGCTGG 
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-r-TCGAACTCCTGACCTCGTGATCCGC . .CCCCCTCGGCCTTGTTTGCT x 
GAGGTACTGTCTAAATGCTGGAACTGAAAATGGCAAGCAAGACATCV.CTA 
^CTTGAGGAAACTGTAATCTAGTCGGAAATACAGATGTCAACCAAGTC. 

CACACAAGAANATTGTACAAAACCCCTAGGA 

>concigl5 



$£S£aa?a^^ 

CTGTGCTTTTATAAGCACTTAGCTAAGATTATCTCACATAATTATCATAA 
rrACTGTTACTATGACCACTTTACAAACAAAACTGAGGCACAAAGAAGTT 
r^AAAACTAATCCAAACAAACTGGCTCCAAAAGGAACTTTGCTTTCTTTG 
GGTATCAA^TTCTGAAGAGTACACATTTAACATTGAAACTGA^GTCAGAA 
GGCAAGTTTCTATGTAAAGTTGGAGTATTCTGAATACTCTGGGTAGCTAC 
^^nTRTTTAAATTTTATCTTGGATTCTGCAGATAAGGATAAAATAGA 



TCGAGAAATGAAGTATTATATTTTTGTGAGTTGCAAXTu i 1 1 i£ ± i I^™J. 
TTTTTGTGTACTTTGGTTGTCTTTTCTTGTGTTTTCCTAGTACCAATGTT 

ATGCTGACTTAGGCAGATGAGTTGAGTATTTTCCTTTTTGCCCTATAAAC 

TGAAAATAGTTTGTATGACATGAGAATTATTTTTATTTTTTGAAGGTTTG 

ATAAAAACTTGCCCATAAAAATCGTCTGGACCGGTTTCTTGAGGATGCCx 

GTGTTAGAGCC 



CGCTTTAACCTGGGCTACCAATGGTTCGTCAAGTTCTAGATTCTCTATTA 
r_.iii^^^^ w ^^nT^^rTrtr,TrTGTTTTCAGCCCCGAGTCTCT 



TAGATCTGTCC7CTAATATTCCTATTGACTTTAI.TI lahu ^ mw«, * w * 
ATCC TTGCTTTACTTTCCGAGAGACCTGCTTAACCTTATCTCCCAA 
C^CTTTTATTGAATTTCATTTCTTTTACTATATATTTTTTACTTTGAATA 
-ACC^^CTCTTCCTCACATTTTCCCCCATAGTATTTTGTCTTCAATTGA 
1V__1~ — ~,~r™ rrrrnm rz a.TRTT AATAATAGTTTTTAAATT 



TTTATTTATTTTTATTTTCAAAACAGTGTU i i/w.i<-ivji w*v- * -------- 

GAGTGCAGTGGTGTGATCATGGATCACTGCAGCCTTGATCTCTGAGCTCA 

AGCTATCCTCCTGCTTCAGCCTCCCAAGTAGCTGGAACCACAGGCATGTG 

TCACCATACCCAGCTAATTTTTTTGTTTTTGAGGTGGAGTCTCACTCTGT 



i^^^i^^rrarTaATTTTTAAATTGTTAATAAAGACGAGGTCTT 



GCTATGTTGCCCAGTATGGTCTTGAACTCCTGGGCTTAAGTAATCCi^ 

gcSSSEtcccaaagtgttgggattacaggtgt^ 

ACATTTTTTAAAAGTTTTCTTCTCTTTACCAAGTCTTTTTTCCCCTTTCT 
GCTTT^TGGGTTGTTTTATTTTGATCTCTATCTTGCTAGAAACTTTCTG 

cagacgS^taatactagatttttgagagtgggcaactggaaagctga 

. r i±_-iCV™,« ^ » tv^^t." * rv.^TTnTTr.arTGTGAGTGTCATTG 
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zvTTCAACATCATGCTGAAiCCTTC 

TCCTACCTTAACTCTCTGTCTTCTGCTAGTATGGGAAAGATGACCTGAAA 

2VTCTAACCATTTATTTTTCCCCCATTAATATCATTTTATGATTATTCAGA 

AGTTAAATAATTGTCATGCTGTCCTCCAAAAAGACTGAATCAACTAGCAA 

GAAATAAGAATTTTCTCACAGCTCTGCCAGCATTTTAAAAGAATAGCTTT 

ATTGAGCCCAGGAGGTCAAGGCTGCAGTGAGCTGTGATTACACCACTCTA 

CCCCAGCCTGGGTGACAGAGCAAAACCCTGTCTCAAAAAAGAAATTTAAG 

GAACAGCTTTATTGTTGTAAAATAGACATACAATAAACAGAGCACATATT 

TAAATTGTGCAACTTATACTTTGATATAACCCTGTGAAAACATCACCACA 

ATCAAGATAGTGAATATATTTATCACCTCCTGATACAGTTTAGCTCTGTG 

TCCCCACCTAAGTCTCATGTTGAATTGTAATCCCCAATGCTGGGGGAGGG 

GCTTTGTGGGAGGTGATTGAATTGTGGGGGTGCACTTCCCCCTTGCTGTT 

CTTGAGATAGTGAATGAGCTCTCATGAGCTCCCCTTCACTCACTCTCTTT 

CCTGCTGCCATGTGAGGATGTGCTTGCCTCTTCTTTGCCCTTCTGCCATG 

ATGTGTTTCCTGAGTCCTCCCTAACCATGCCTCCTGTACAGCTTGCAGAA 

CTGTGAGTCAGTTAAATCTCTTTTCTTCATAAATTACCCAGXCTCAGG7G 

GCTCTTTATAGCAGTGTGAAAAGGAACTAATATACCTCCTAAGTTACCTC 

AAGCTTGTTTTTAATTCCTTCTCCTCCCTTCCTTCATTGCCAAGCAAACA 

ACCACCTGTTTTCTGTCACTATAGATTAGTTTACATTTTGTGGGTTTTTT 

TTTTTTTTGAGACAAGGTCTGACTCTGTTGCACAGGAGCAGAGCAGCGTA 

TC 

>Concigi7 

CGCGTTATAGGAGATGCGAACTTAAGAAATGATGATAAGGAGACTTTA7T 

AAATATAATTTTGAATTATTTTGCCATTACAGAAATTCTAATTATTTAAA 

ATTCTATTCATAATTTTTAATCACTGTACTTCCCAAGCTTAGCTTAGAAT 

CCTTCTGTGCTGAGGATTAATTTTAATTTGTCTTTTATAGGCCTTATCTA 

AAAT C C AAGAATAATTGC CAGAAT C AAC CAC CTTCTAAAT CTGTAAGTAG 

AAATTAGTCTTTTTAAAAATATGCATTCATAAGTATGATTAGTAATAAAA 

ATAATAAAGATGTTAGCAACCTAAAGAACATGTATTTGAAAGGTATTTCT 

TACAGATATAAAAACAGTTTGGTTTAATAAGAGACAATCATTTTTTGAAA 

AGTATGACATTTTTTGAAAAGTAGTTTAGTTTTATTAACCAAGAAAAGCC 

TCAAGTGAACTTTAGTCCTCTTGATAGCTAACATTTATTGAATGCTTACT 

GTGTGCCTGATACTTTTCTGACTTGCATTACCTCACTGAGTCCTCACAAT 

CTTATGAGGCTACTATTAGTAGCCCCACTTTACAGATGAGCAAACTAAGT 

CACAGAAAGGTTAAATAGGTCGTATAGGTATTAAGTGACAAAGCTGAGAG 

CCTGTGATCTTAACCACTTTGGTATGCTGCCATGAAGTTAAATAGCTCAG 

TAGTCATTAAAAGAGAACATTTGCATTGAACCTTCCAAGCCACTTAACAA 

GTATATGCTTCCTAATCAATTTAATTTAGCTACATTAGATAGAATGGTAA 

\GGATCCTTAACTTAAAGTTTAAATGGAAGAAATTAGCCCTCTGAAAGAG 

GCACAGATTATTCATCTGCAATAAAAATCTCACCTTTAGTTTTTTAAAAC 

ATAGTTTTTATCTGTGTTCTGAAATGTAACTAAAACAGTGCTTCCTGAAG 

TGAAAAATTCTCACTGGTGAGAATTTTAATAAGTTTTAATGATTCACCAA 

ATCACTTCAGTCATATTTCAGTCATATGCATATGCATATATAGACATATA 

AGTTTTTATCTGTGTTCTGAAATGTAACTAAAATAGTGCTTCCTGAAGTG 

AAAAATTCTCACTGGTGAGAATTTTAATAAGTTTTAATGATTCACCAAAT 

CACTTCAGTCATATTTCAGTCATATGCATATGCATATGTAGACATATATA 

TGTTGTATGTATACATGACATCATTAGACACTGTGAAGGATAGCAAAATG 

TATATAAGGCAAAATTTATGAACAATGGTTTAACGTTTGGGAAGCACTGG 

GTTACACTTTTACTTTATGCAGATTGAACCAGTATAGTATGCAAGTCTTA 

AGGAAAAATCTACTGGAAAGGGCCCTCATTCAGACTTCCCAGAGGCTTCT 

CTGGAAG7TGACAATACTGACTTCAGTACATCAGCTCGTAAATGAGGATG 

ATACCTACCTTATCTGCTTTACACAGTTGTAAAAGTAAAAAGTGAACTCA 

GGAAGGGAATTACAGAATTTAGGAGAAACTAAAAGCACGATGTAAATAAT 

AGTCATCATTACAGTTATATAATGCTTGACAATTTATATAACACTTTCGA 

TACATGACAACAATAACTAACACCCAGACATGTTTATATACATTACCTCA 

CTCAGAACAACCATGTGAGGAAGTTGGCCATATGCTTTAATGTCCAAACC 

AGGACACTTTTGAGAGTAAAAAGCAGTACTCTTTGACCAACAGGCATAAA 

TCAAAACTATCTTGTGAAAACCGGGATATATGGCATCCTTCCTAGATAAT 

AGATACTTTTACTATTATTAATTTTGCTGTGAATCTAAACCTGCTCTAAA 

AAAGTTAATTTTAAAAAGTAATGAAGTACTGATACATGCTACAACATGGG 
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TAAATCT7GAAAACGTT**rGCTAAGTG. _ JVGAAGCCAGACAGAAAMSSc w 

ACATATTACATGATTCCATTTATATGACACATCTAAAATAGGCACATCTA 

TAGACATACAGAGACAGAAAGTAGACTAGCGGTTGCCAAGAACTGCAGGG 

AGCAGAAGATGGGGAGTGACTGCCAATANGAAAACGCATTACGT 

>Concigl8 

TGAATCGCAATGATATGTGCCACTTTGCACTCTCTGTGACATATATAATT 

ATTTTTAATGCATTCATTTTTTTCTCAGAGTGCATTCGTTTGAAAACATA 

GAlZGGGAAATACTGGTAGTCTTCCTTGTCAGTTAGAAACACCCAAACAAT 

GAAAAATGAAAAAGTTGCACAAATAGTCTCTAAAAACAATGAAACTATTG 

CCTGAGGAATTGAAGTTTAAAAAGAAGCACATAAGCAACAACAAGGATAA 

TCCTAGAAAACCAGTTCTGCTGACTGGGTGATTTCACTTCTCTTTGCTTC 

CTCATCTGGATTGGCATATTCCTAATATCCCCTCCAGAACTATTTTCCCT 

GTTTGTACTAAACTGTGTATATCATCTGTGTTTGTACATAGACATTAATC 

TGCACTTGTGATCATGGTTTTAGAAATCATCAAGCCTAGGTCAGCACCTT 

TTAGCTTCCTGAGCAATGTGAAATACAACTTTATGAGGATCATCAAATAC 

GAATTCATCCTGAATGACGCCCTCAATCAAAGTATAATTCG^GCCAATGA 

TCAGTACCTCACGGCTGCTGCATTACATAATCTGGATGAAGCAGGTACAT 

TAAAATGGCACCAGACATTTCTGTCATCCTCCCCTCCTTTCATTTACTTA 

TTTATTTATTTCAATCTTTCTGCTTGCAAAAAACATACCTCTTCAGAGTT 

CTGGGTTGCACAATTCTTCCAGAATAGCTTGAAACACAGCACCCCCATAA 

AAATCCCAAGCCAGGGCAGAAGGTTCAACTAAATCTGGAAGTTCCACAAG 

AGAGAAGTTTCCTATCTTTGAGAGTAAAGGGTTGTGCACAAAGCTAGCTG 

ATGTACTACCTCTTTGGTTCTTTCAGACATTCTTACCCTCAATTTTAAAA 

CTGAGGAAACTGTCAGACATATTAAATGATTTACTCAGATTTACCCAGAA 

GCCAATGAAGAACAATCACTCTCCTTTAAAAAGTCTGTTGATCAAACTCA 

CAAGTAACAC C AAAC CAGG AAGATCTTT ATTATCTCTG ATAAC ATATTTG 

TGAGG CAAAAC CTC CAATAAGCTACAAATATGGCTTAAAGGATG AAGTTT 

AGTGT C CAAAAACTTTTATCACAC ACAT C CAATTTTCATGG CGGACATGT 

TTTAGTTTCAACAGTATACATATTTTCAAAGGTCCAGAGAGGCAATTTTG 

CAATAAACAAGCAAGACTTTTTCTGATTGGATGCACTTCAGCTAACATGC 

TTTCAACTCTACATTTACAAATTATTTTGTGTTCTATTTTTCTACTTAAT 

ATTATTTCTGCAATTTTCCCAATATTGACATCGTGTATGTATTTGCCATT 

TTTAATATCACTAGACAATTCAATCAGGTTGCTACGTTGGTCCCTTGGGT 

TTACTCTAAATAGCTTGATTGCAAATATCTTTGTATATATTATTGTTTTT 

TCTCCTATCTTGTAATTTCTTTGAGCACATCCCAAAGAGGAATGCCTAGA 

TCAATGGGCACAAATAATTTGACAGCTCTTATTAAACATTATTCTGTAAG 

TAAAAACTGAACTACTTTTCAGTATCACTAGCAACATATGAGTGTATCAG 

CTTCCTAAACCCCTCCATGTTAGGTCATTATGAACTTATGATCTAACAAA 

TTACAGGGTCTTATCCCACTAATGAAATTATAAGAGATTCAACACTTATT 

CAGCCCCGAAGGATTCATTCAACGTAGAAAATTCTAAGAACATTAACCAA 

GTATTTACCTGCCTAGTGAGTGTGGAAGACATTGTGAAGGACACAAAGAT 

GTATAGAATTCCATTCCTGACTTCCAGGTATTTACACCATAGGTGGGGAC 

CTAACTACACAC^CACAC^CACACACACACACACACACACACACACACAC 

CATGCACACACAATCTACATCAACACTTGATTTTATACAAATACAATGAA 

TTTACTTTCTTTTTGGTTCTTCTCTTCACCAGTGAAATTTGACATGGGTG 

CTTATAAGTCATCAAAGGATGATGCTAAAATTACCGTGATTCTAAGAATC 

TCAAAAACTCAATTGTTTGTGACTGCGCAAGAAGAAAACCACCCATGCTG 

CTGAAAGTCAGTTGTCCTTTGTCTCCAACTTTACTTCCTTTACCTCTCAT 

ATG7TTGTGAATAAGCCCAATAAGCAGACNCCTCCTACAAAGTGAACCTG 

GTCTCTTTCCTCCTAACAGGG 

>Concigl9 

GTCTTGTAACACAGGTAAGACGAGTTCAAGTTTTATTTCTTGNTTTTAGA 
ACGGTAGTGAGCGGTTTTCAGCNTGAGACCACACCTAAGGTAAGTAGCTG 
AATTGGGGTTTTGTCTTGGCTAAAGTTTAACAACCAGCTGGTCTTAATTT 
CTCCTTACCATTAGAGCACTCAGTAATCATATAAGTTGTGTGATCATTCA 
TTTTGCTTAACTGTTTGTTTCTGTTTTTATTGCTGTTTCAGTCTTTTTCC 
CATTGGGTTTGACCTACTCTATCTGACTTGATCAAATCCAAAGGAAATTT 
CCAAATTATGGGGAATGAGGCCTCTGAAGTGGCTAAATTCCCACCCTCCC 
ACACACACAAACGTGGTATGGTGGGGGAAAAAACGGCCAGCAAAAGAAAA 
AAAAAAAGGAAAAGATGTTTCATTTTGACCACCAAACGGGCTTTATTTAC 
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ATAACAAGGC CAC CTTT * *'GCTAGCCA ^CCATACTGAAAGAGGSSTGL,- 

TGTTGCCCCATGCTGTGGGTTCCATAGCTAACG7TCTGCCTTTTTTCCTA 

CCACGACAGCCTGGGTTTGGTTCCTAAATCAAGCCTTTTCTGGTTTGATA 

CTTGGTAATGCTGAAATAGCAGCAATTTG7CCTAGCTGAAATATCGTAAT 

AAGATTTTAAAAGATTTATTTTAAAGGACCTCAATAGTTAAAAGTCAGCT 

TAATTAAAAGCTAACATCCAAGATGTGTGCATGTGTATGTATGCGTCTTT 

GTATTTAAATAGCCCTCATGTTTTTTTTTTCTTTCCTAGGAACTTGCCTT 

TTTTTGAGCAAAAGTTTTTTTCTTCTCTGTTGACTGGATTCTGTTTTCTT 

CATTTACTTCTGCTGTCTCTCCTTTCTCTTGCACCGTCTGCTGCATGAGA 

GCCCTAAAATAGTTTATAATAGCCTGGGGTTCCTTAAAGAAAATGGAGAA 

GGTGCCAGGCTCCCTTTTAGGGAGAAACTTCTATTTTTCCTTATGGAATC 

CCTAGAGTGTAAACAGACAAGTTCATTTCAGCTCTTAAACTGCTTGCGTT 

TGTGTTGTGTTACCTGATTTTTTTGACTATTATATTTTTGACTAGCTATT 

GCAACAGAAGCTACTCTTGGGTTTTCAAGGAAGATTGTAGTTTAGACATG 

TAGAAATGTCTTTTAAAAAAAAAACAAACTTTTTTTTAAGTGCACTGTAA 

AAGC AT C AT ATGGTCTAGC CTCCTAATAATTTTC C CTTTTTGG AG AC CAG 

GATTCAGGGTGGGCTCTGCCCAGAGCTCAGAGATCCAGTTAAAAGAGAGG 

TAGTCTCGGCCGGGCGTAGAGGCCCAGCCTGTAATCCCAGCACTTTGGGA 

GGCCGAGGCGGGCGGATCACGAGGTCAGGAGATCGAGACCATCCTGGCCA 

ACATGGTGAAACCCCGTCTCTACTAAAAATACAAAAATTAGCTGGGTGTG 

GTGGCAGGTGCCTGTAGTCCCAGCCACTCGGGAGACTGAGGAAAGAGGAG 

AATCGTTTGAACCCGGGAGGCGGAGCTTGCAGTGAGACGAGATGGCGCCA 

CTGCACTCCAGCCTGGCGACAGTGAGACTCCGTCTCAAAAAAAAAAAGAT 

AGGTAGACTCGATGTTGTCGTACCCGAGCAAGTTAGAGCAACGCCACACT 

TTGAGACGAATTTAAGAGTCCTTTATCAGCCGGCGACCAAGAGACGGCTA 

ACGCTCGAAATTCTCTCGGCCCCTTGGAAGGGGCTTGATTTTCCTTTATG 

CTTTGGTTTAGGAAGGGGAGGGGAGCTCAGTTGCAACAATTCTACAGGAG 

TAAAAACATGCAAAGAAATTAAAAAGACAAGTGGTTACAGGGAAACAAAC 

AGTTCCAGGTGCAGGGGCTCTAAATCTATCATAAGATGTTAGGTATGGGG 

GCTCTGCCGGACACAAACTCAAGGCTTTATGCTGTTATCTCTTGAGCGAA 

ATCCTGGGAACTTCGTACATTGCTTGCTTCAGTACCTTATCAGTTAATCG 

GACTCTTTGATATGTTGGGAGTCAGCGTACACAAGTTAACTCCTTGAGGA 

AGGGGGTGGGTAAGGAGTCCTTGATGTCTGGTAAATGAAGGAGCGAAATC 

GAGTTCGTCTGGCTTTCTCAGCTAAGGGAGAGCTTATTCATGTGGAAACA 

AGGCTAAGTGATTAAGGGAGAAAGGGAGAGTCTGAAAACAAGGTTAGGTA 

TTACAATGTCAATAAAATTGGTCTCCTTATACAGTCCTATGGTAGATTTC 

TTTCCATCTTTAATCTCCCTCTAGCACCACCAGACTTTTTCTCTCTGTAC 

CTTGAGATGTAAATTTTGCTATCTGAATTTTCGTCTAAGAGTTGTTTCCT 

TTAATATGCAAATTTAGGGTTATTTAGCTGACAACTGCCAAAGTAGTGAA 

ACAAGTTATCAAGAACTTGAACGTCTAAGGTAGGAAAAAAAAAAGTCTTT 

ATGAATCTATAAGATGTACTTCTATTGGCATGCCTAATACGTCTATGTAT 

TTACGTGTTGTGTACACAGTTTTTCACTACTGAAAATATATAGAGGAGTT 

CTAATTAATTGACTTAAGACAATAAAAGCGCTTGAATCAAATACCTTATC 

AGGAAAAAGGAAAAGACAAGTCAAATGCTTGTTCAAGTCTATATAACTTA 

AGTAAAATCTTTAATAAATAAGCTAGCTTTAACATTATTTGAAATGTCTT 

AAGAATTGCCAGCAGGTTCTGGGTTACAGAACTAGTGGGGGTGCAGTGGG 

GTGAGGGTTGGTGGGGTGGGNGGTNNNACNNNNNCNCCCCCCCCCCCCCC 

CCCCCCCCCCCCCTCCCCCCCCGCCCCGNGCGGGCCGCGCCCCCCCCCGC 

CCCCCCGGCCCGCCCCCCGCGGCCCCCCACCCCCCCCCCCCCCCCCCCGC 

GCCCCGCCCCCCCCCCCGCGCCCCCCACCCCCCCGCCCCCCCGCCCCCCC 

CCCCCCCCCCCACCCCCCACACCCGGCCCACACGCACCCCCCACCCCGAC 

GCCCCCGCCCCCCCCCCCCCGCAGCCGACGCCCCCCCCCCGCCCGCCCCG 

CCCCGCACCCCCGACCCCCCCCGCCGCCCCGCCCCCGCCCCCCCCCCCCG 

GCCCCCCCCCCGCCGGCGCGGCGCCCCACCCCCCCCCCCCAGCCCCGACC 

GCGCGCCCCCCCCACCCCCCCCCCAGCCCCCGCCCCCCGCCCCGACCC 

>Concig20 

GGCAGTACGCTATAATTCCCTCTTCACCTTACCTCATCTGTTCTCTGATG 
GATGTACTTTTTTTTTTAGTTTCTAAATTCCCTTTTCCTTTGCTCTGGAG 
ATGGGTGATTGATGTAGTCTGGGTATTTGTTCCCTCCAAATCTCATGTTG 
AAATGTAATCCCCAGTGTTGGAGGTAGGGCCTGGTGGGAGGTGTTTGGAT 
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CATGGGGG CAGATCCC . ^ATGAATAGL .'GGTACTGTCCTCTCAATAG _ J: 

AATGAGTTCTCCTGAGATATGGTTGTTTAAAAGTGTGTGGCACTCCCCCA' 

TTGCTCTCTTGTTACTGCTTTCGACATGTGACATCCCTGCTCCCCTTCGC 

TCTCTGCCATGATTGAAAGTTTCCTAAGGCTTCGCCAAAAGCTGAGCAGA 

TGTGGGTGCCATGCTTGTACAGCCTGCAGAACTGTGAGCCAAAATAAACT 

TCATTTCCATATAAATTACCCAGCCTCAGATATTTCTTTATAGCAACATA 

AGAGTGGCTTAATACAGGCTGGGCATGGTGGCTCACGCCTGTAATCCCAG 

CACTGTGGGAGGCTGAGGGGGGTGGAACATGAGGTCAGGAGATTGAGACC 

ACCGGCTAACACGGTGAAACTCCATCTCTACTAAAAATACAAAAAATTAG 

TCGGGCGTGGTGGTGGGCGCCTGTAGTCCCAGCTACTCTGGAGGCTGAGG 

CAGGAGAATGGCATGAACCCGGGAAGCGGAGCTTGCAGTGAGCCGAGATT 

GCACCACTGCACTCCAGCCTGGGCGACAAGAGTGAAACTCCATTTAAAAA 

GAAAAAACAAAATTTCAAACAGAACAAAATGAAAAAAATACCAAGTGAAA 

GGCCCCTATAAAAACCCCTCTGGGGCCCATCCTCCCACCCCCTCAAGTGA 

AACCACATTTAACAATTTGGTGCATATCTTTCCAAACCTTTTGTTGTACA 

CATATAAAAAACATACATGCTTTGATTTGGCTCAGACTGTACATAGTGTT 

TTCCCTCTTGCATTTTACACTTAATATATCTTTGACATCTTTCTATGTCA 

GTGCATGTTGGCTCGATGATATTCTATCATTAAATACCCTTCCAAAAATG 

GTAAAATCATTTTAAAAAATCATTCACACAAGTACATATTTACAATTTTA 

AAAG AAAA CAG AAT C C CAAAACA C AAC G AC AAAC C T C T AAAAAT AAT CT C 

TATCTTTCCACCAGCATGGAACAGTTCATTCCTTTTTCACATAAAACGAA 

TTATGTGATTGGAAAGATTAACTCTAATCTACACATTTATATACAGAATG 

TTCTATTTGTTAAGCCTATCTGAAAATAAAAAATTCAGATGATTAATTCA 

CTTACACTTAGAAATTAAGTCAATATACTATGAATACACATTGTGATCAG 

TTATAATATGATGCTTCTTAGTCTAGGGTTTCAATTAAATAACAGTAAAA 

AAAATTGGATAAATAAGACAGCTAATAACTGAAAAATCCAGAAATTCAAA 

GATTATATTGCCAACTAAAACACTGCCATTTACATTTTTTTTTCCTACT7 

GGTAGCAAATGCTAATGGAATTCAATCCTGATTACTTAAAGTCAGTTCAC 

ATCACACATTCAATCAGGATAATACGAACATAATATGCCTACTATAGCGT 

TAGATTAAGACATAAAATTTTTTTGCTTGAAAGTAATGACTGCGTACCAC 

TTGAGACATTTGTCAACCACTTCAGCACATTGTTTACGAGTGACTGGATG 

TCCACAAGGAATAAAAACGACAGCAATATTTCTATCCATACAGATTTTGC 

AAAGCTTCTCCTCTTGCAGGTGTCTTAGCTGCTCTTCAGTACTAATCTCT 

TTCTGCAATGAAGTCTGACTTGATTCGTCTTGTGTACTGTCTTTCTGAGC 

CTTCACTGGATCTGCAATCAGAACCTCAAGTGATTTACAGTTGCTCCCAG 

ATGTCTGAATTTTTTCCTCCATTATTTTCTTAATGTCTTTGAAACTGAAC 

C C CATTCAT ATAGCTTCTTGTACCATAGGATTATGGAAGATGGTATCAAT 

TTTTCTAGTTAGTGATGGCGTTTTTTCAGCAGTTCTTACCAGACACTCCT 

CAAGTGAATGGGATAAATGAATATTGTTTATATATTTTCGTGTCTTCTGT 

TCTAACAGATATTTACACCCTGGATGCCATTAACATGTTGTCCCAAGGGT 

CTTNCTGGGCT 

>Concig21 

CTTTCTCCCTTTTTACCCCCATTTTCGTAGGGATTTGGTTAAAACCCATG 
TAAAAAATCCAAACACCGGCGGGGAACGGGGGTTCAAGCTCGTATCCCCA 
CCACTTTGGGAACCCAAGGTGGCAGGATTGTCGGAAGCCAGGCATTTGAG 
CCCACCCTTGGGAAAAAAAAGAGAACCCCCATTTTTTTTGAACAAAAACC 
CCAACCCTCCCAGGAAAGAAATAAGTATGGCTGGGTTGAAGTCACCAAAG 
ATGGCCGACTGGCTGGTCAAGTAACTTTACCTGATGGTTCGTAGAATATT 
TACCTTCACCCAGGTGGGAGAATTGCTTGAGCCAACCCTCAGTGTGGATT 
CAGGAACTTGATTTAATTGGTATCGTGATTGTGGATTAGATTCTCAGGGA 
TGCATTCACTAAGTAAAAGTGATAATAGCTACTTTTAAGTAAAATAATGA 
ATGAATCAAACACTCTAAATCCATGGTGCTATGCTAAGCTCTTTCTGTAT 
TTTATCTCATTTGATATTACAAATATTTGATGTGTTAATAGTAATGACTA 
TCTCCATTTTTACAAGTAAGGAAACTGACATTGAGAGATTAAAAGACTAG 
CACAAATCACAAAGTAAATGAGATTTGAATCCGGTCTTGATTCCAAACTC 
TACAGTATTCTAAATTCAAGGAGACTAAATTATAAGATGGAGAGCCAATT 
TTACTTTATAACAGGGTTAGAATGGCAGAAGAGACCTGACATTCACACCT 
CTAGCCAGTGCATCATCTTCCTGTAGGCAAATATGCAGGAAATCTATAAT 
AAGAACGTCCTTTGGTGAAGGCCAGGTGCAGGGGCTTACACTTGTAATTC 
CAGCACTTTGGGAGGTCAAGGTGGGAGGGTCGCTTGATGACAGGAGTTTG 
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AGWVCAGCCTGGGCAACATAGTGAGAC^TGTCT 

ACACAAAACAACTTCAAGAAAACTCCTTTGGTATGGATCAGAACAAGATG 

AATTATCTATCTGATCCAAATGCTTAATGACATTAAGCCACAGTCCACTC 

ACTGCCACAATAGAGATATACCTGCCAATGCCACTCAGGTAATCCCATCA 

AAAGTGGTAATGAGGTCTGCAGCATGACTTGTTCTTAGTGATCCCAGCCT 

GAGACCTTGAGATTGCAGCATTTTATTCTACATATGCACAAAACATCTGT 

TG^AAAATCTTCTAAATTGATGCAATACATTCGTATCAAGAATACCTGTC 

TGTAATCTCCATAAACCCTCTCCTTTCTGTTTTAAAAAATAGTAACAGCA 

TTTCTCCTTACATGACAAAGAAATGACTTCACCATCTACGAAATAGTGAA 

TAGGAGCTGTGTGGAAGGAAATTAGCTCTACTTCTTGGTGGAGATGAGAA 

GGGAGTGTTCCTCTGAAAATCAAGGCTCTTGTCATGCTAGGAGCCAAAGT 

CGTTTTTTAGAGTGTGGACAGTTGAGAAGATAAGACAGGGACCATCCACT 

CATGTTTTTCTTATTCCATAGGCCTCTCTCAATTGGGCAAAGCACTCCAG 

ACCTTTTGGAAGAGTGACACCAAAGGCAAGCACCTGCTTGGCAGGCCCCT 

CAGCTTCTACGCAAGTATAAGTGAGTATATAAAATGGGGGTACTTGTGCT 

GTTGAGTACCTTATTTCCAAATGAGGCCTGCCGGTGTCCCTGTGGCTGTG 

AGAAGGCCTCTACTGGATAGGTGGAAGTTGTGTGTTCTCATCTTTTCTAA 

CCCTGGATTGACTTGCCCAAAAGGAAGCCATTATTAACACTATAATAAAA 

CCATCCTTAATCTGGGACTCTCTTCATGCAGTGGTTCTTAACCAGTGATA 

AACATGAGAGTTACTTTTGGAGCTTAAAAAAATTAAGATGCTCAAGGTCT 

ACCCAAACTGACTGAATCTCCAGAGGTGAGGCCCAGGGATGTATACTTTT 

GAGCCAGACCTCAGTTTACCCTGCAGAGCTCATAAGGTTGCATAACACCC 

TTTGTCAGCCACTCTGATGAAAAGAAAAATTGGTGAGGAATAAGTTTTAG 

AGAAGAAGGAGCAAAGGTGTTCTTGGCCAGTGAGAGCCAATGACAGGGAA 

ATGCAAACAATGTATCCACAAGAAAGGTAAATTACCCTATAGAGCATTTT 

AGGATAAATGAACATCTCATGCCTAGGGTTGAGAGAGGGTACAAAAAAAA 

AAAAAAAAAAGACCACTCTGGATACACAACGCGATAAATGGAATAAAGAA 

TTTTTTCCTTGTAAATTAAAAAAATCCTTTGTTACTGAGGTATAATTTAA 

TCTATTTTATGTATAGTTCAATGAGGTGTTATAGATAATAAATTTTTTTT 

GTAAATTATTATATTGTCATATACTCATACATTCATTTTTAAAAGTCAGA 

AATGTATATAACCATTAAACTTATAAATCATTCAGTCATTCAGAGATATA 

GATACACGAGCATATTTTATATCCACCACAATAATTATTACCATCTCAAC 

AATTCCATCACCCCTCAAATTTCAAGCGTAGGGGTTTTTAAATGTCAAAG 

GAGTCTACTCAGTGGGAAGAAAGTTAAGGAAAAAACCTTTGGGGCTTTGG 

GCTCCTTCCCCCTGGGGTTAAAAAGGCAGGAAATTGGGCTTACCCCCCCT 

GAAATTGGGAACTGAAATTTTGGGAAGTTTAAAAAAAAAAAAA 

>Contia22 

TCAAGCAGCCTTCCTTCCTTGGCTTCCCAAATTGTTGGGATTACAGGCAT 
GAGTCAGGA7TCCTGGCTTAGTTTACATTTTCTAGAGTTTTGTATAAATG 
GAAACATACAGAATGTATTTTTTTGCGGAGTGGGGGAGTGTTTCTATTTC 
TTTCTTTCCATTTTCCCCCCCCCNCCCCCCCGAGACGGAGTCTCGCTCTG 
TCTGTTGCCCAGGCTGGAGTGCAGTGGTGCGATCTCGGCTCACCGCAAGC 
TCCACCTCCCGGGTTCAAGCAATTCTCCTGCCTCAGCCTCCTGAGTAGCT 

GGTAGAGACGGGGTTTCACCATGTTAGCCAGGATGGTCTCGATCTCCTGA 
CCTCGTGATCTGCCCGCTTCGGCCTCCCTAAGTGCTGGGATTACAGGCGT 
GAGCCACCGTGCCCGGCCCAAGTGTTTCTATTTCTTAACCAGCTTTCATG 
CAATCTTTTTTTATTTTACCATCTCTGTGATCCCACTCCCAAAGGTACTA 
GATGTCGATTGGTCCTTAGGATCAGCTACCATTTGCCCAACTGCTTTCCA 
GCCTTCCAAAAATTTTTTTCTTTTTTTCTTAAAGATACTCCTGTGTGAGG 
CTCAGAACTCTTGAATTGCTACTGCAAATATGAACTCGGTGATGTGAATG 
CCAGGGAATTGCCTGATTGATGAAAGAAATGTATCCCCTTCTCCCTCACT 
CTTGCTGTCTTCTCATTTGTTTTCCCCATCCTTGTGGATTCGTGAATTTA 
AATATCCCTTTAATGTTATAATATTTTAATGGCGTTTGGCGAAAAGTACA 
GAATTAGGTGCAAGAGTGCATAGCTGTTATTTTTTTTTTGGCCTCTGAGA 
CTGTTCATATATGCAAGTTATTTAACAGAAAGTTCTGCAGTGACCTGAGA 
TGTCAGGGGGGTCTGATAGAGTACGTTTGAAGGCAGTTACTGGAAAAAAA 
TAATGCCATTTCTGGTTTGTACTTCGGTAAGTTCAGATGACCCAATATAT 
TGTTTACATGTGGCATTCAGTAAAAAAGTAGCTTCCCCTCCCTTTCTTCT 
TCCTTTTCTCCTTTCCTGCTTCTATAAAGCATCTGCTTTGGGAAACTTCT 

FIG. 3 (10 of 52) 



NSDOCID: <WO 9906426A1 J_> 



WO 99/06426 



PCT7US98/16102 



TAGGAGGAGAGCTTGCC^JCCCGTGGL .^TGGAGAGGTCTTGGA(SA'GA . i 

AAAAGAGATGCTCCCACTCAATGCAGGATGG7GTGGAGGTAAATGGGGAT 

ACGTCTGGCATCACTCAGGAATGGGCCTTCCTGGCAGGGAAAAAAAGGGA 

GGGGAAAGAGGAAGGGAATTCNNANATNAATTGCTGAATACGGGGATTCC 

ATGGCCTGGATCCAGGAAGAGAACTTTGGGAGGTGTGAACCTGGAAGGCA 

TCANCTGATGAGGAGCAGCCTGAACTCCGGGGAGGACCTGTTTTTGGTGG 

CCCGGAAAAAAATGCCTTCCACACACAGGGAGGCCACCCGGCTGATGGGC 

TCjGGGGTTGGACGGACAGCCCTAGGACAGGCTTGGGAAACCAGGCTCAGG 

TAGGGCCTGCGAGGTTCTCGCTGCGTCTCTTTCCTTCCTGGTCTTAGAAA 

ATAGAATCCAAGGCCTCTTGAGAGTGGAAGGTGGGTTGGGAGGAGGGCAG 

ATGGGGCTTAGGCCCAGGACACCCGTAGAGCTACTGCCCAGCTGTCTCTC 

AGGGACTCTGCTGAGGTCACTCCAAGGATCATTCTTAGCCTTGCTAGACA 

GTACTGACAGAGGGAACCGTAGTATCGCACCCACTTCCTTCTCTTTCAAT 

GAAAGTTTAAAGGTCACCATTTCCTCTGGCAAAGGAAGTTCCACAAATAT 

TCCATTTCCGGTCTTAGAAACAGCAAGGTATCAAGCAATTGCAAACTTCC 

TGTGCTGGGGAATTCCCAAGGAAGTAGGGGCAGAGTTCTGGTGGAGACAA 

AGTGAATTCCGAGTGATTAGTCAGTAGCAGTAGCAGTAGCAGTAGCAGTA 

GCAGTAGCAGTAGCAGTAGCAGTAGCAGTAGCAGTAGCAGCAGCAGAACC 

AGAATTTCCCCGCACGTGTCTCAGGCTCTCATTTGCCAACTCAGTCTCTA 

AGTATTTTTATTGGCAGGAAAAATAAAATAGCTATGAGTGAAATAATTCA 

TTAGACCTGAGCCTCCATCAATTTTGTGTTTAAAGGCCTGACTCTCTTTA 

CCTTTCCCTGGGATGGAAGATGCAAATGTTCCTGATCTCACTGTCAAAAA 

AGAAGAACCAGTGGGTATATTGTATGCTTGAGTTCCAGCCATTAGTCACA 

AGACATAGAGATGACTGCCATGTGTGTAGACTTTCTATAGACTGTGTGCT 

AAACCCGACCTGCCACTTCCAAGGAGTAGATGAGGAATGTCCATGGTTCT 

GGGGAGCCCTACCCCAATTTGGGGCAGACATTCCAAAGCTCATTTTCTGT 

GGAGGGGGTTGATGGTTAAAGGAACGGCTGGGATTTACTCTTCTTTCTAG 

GGCCAAGAAAATGACATGCTGCCTCCATGTTTAATCATCCTTCCCCCTGT 

TAATAACTATGGCTTTAAGTCCCCGGTTAGGGCCTTCCTCCAAAATTGGG 

GAAAAAAATTCCCCTCCCCCCCTAAAAATTTTTTTTTTAAAAAAACCTTT 

TTTTTTGGGGGTTGGGAAAAAAACCAAAAATTTTTTTTCCCCAGGGGTTT 

TTTAATTTAAATTTCTCCCCAAAAATTTGTTTTTTTTTTTCCGCGAAAAA 

AAGACCCCCCCAAAAAAAAAAAGTTTTTTGGCGGAAAAAAAAATATTTTT 

TTTGTGTTAAGAAATGGAGAAGAAGGGGGGTTTTTTTTTTCTTCTCCCCC 

CACCCGCCAAAGGAAAGGTTGTTCACAGATTGTTTTGTGTCTCCCGCCCA 

T 

>Concia23 

ATGTGCCTGCGAAATCATCCTTCCAGAAATATTTGCCCCTTTCTTTTGTT 

ATAGAGTGGCACTGCCCTATATGGTGACCACTTGCCACATGTGGCTGTTG 

AACACTTGAAATTGGCTTGTCAGAATTGCAGTGTAAAGTGTAAAACACAT 

ACCAAATTTCAAAGACATGGCACATAATAAAAAATGTAAAATATCTCATT 

AACAATTTTTATATTGACTGTGTAAGTAACATTTTGAATATATTGGATTA 

AATACATGGATGATGCCCCAACACCCACAGTCCCTTATCAAGTCTCTACT 

TCACATTTTTGTACTTCTGACTTAGAAATAGCACTGGCGTCTAAGAGCCT 

ATTAATGTCGTCAATAGGTTCTTGGGAACCACAATTTTAAACAAAATGAC 

ATATAAGAAAACGAATAACATTGAACAAAATGACATTATTCGAGGACCTG 

CTGCATGTTGTTTCACTTAAAGTCAGTGTCCAAGAAACTATCAGTGACAT 

TTAGTGAGGAATTGCTGTCCTTCCTGTTTACAGGAACCTGGGCAAGTTAC 

TTAATTCCTCTAAGCCCGGTTTATATCCCTGCAAAGAGAGAAGGATAATA 

ATCACCAGTACTTAGTGATGTCGTAAGGAGAAAATAAAATAATAAATATG 

AAATGGCTGACAGTGTCCTTGTCACACAGAAGATGTGTGATCCACAGTAG 

CTGC7ATTGTCTGCCTCACTTCACTAGTAATGGTCCAGGGAGGCCTTTAA 

TGTGCATGGTGCAGTACATTCACATGTTGGACATGGGTGAAGGGAAAGAC 

CAGGCTCATCTAAACACAATAGGATGCTTGTGGTGTTTTGAGGAGGAATC 

AAGGACTAGTTATCCACAGCTGTAACATGCATGGATCAAAAGAGATAAGG 

CACACAAAAGACTTTGTCAGTAGCAAAGCATTACAAAATGCAGAGACCAG 

CTGTGGG7GGTGGTGAGTCAGACCCAGCTTCCCTCTGTGCCTGGCTGAGT 

GGTTCTGGGCAAGTCACGCCATCTGTCTTGATGCCCTTCCCCATCTATAG 

AGAGGGAGCAACTGAGGCCCCTTCCAATACTGAAGTCCTTTATTTCTGCT 

ACTTTAGAAATATCCACATTTTTGGTAAATT-CAAATGATCCAATGATTCC 
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ATTTCCTAATGTTCAAAAw fAGCCCCAu -AACAT CTAAAT G AATEAl^E^ i 

AATAAAATATTTATTGTGTATGTTTTGATTGCTGAAACTTCTATTTTAGC 

AACACACACACACACACACAGAAC C CAT AAGC CTT CAT C TTT C CTTGGAT 

AAACGAGCCTTCCTGTCTGGCCATTTAAGTCACGATTAAGTAAATGATTT 

CCAACTCGCCTTTTGCAGCAGTTCAGATGGGTCTTTCCTGCGTGGCAGTG 

GCCCTCCTGACTTATGATTTCCTGTGTGTCGGCCTGTTACCACTGCAGCT 

TAACTGAGGAAACAAGAACAAAACAGCCTCTGACCCCAAGAGACTGTTGG 

AG&CAAAGGCTTCAGTCCCAAGAACCTCACACGTGGGGAGCCCGAGAGCC 

CAGCCCTGACCTTTTCTCCAGTAATAACATAAGAAACAACAGGCACTGGC 

CTTATTTTGGATACAAAGAGTGGTGCTTTTCCTTAAATCTTCCTTTAGTC 

AGGGGTACCCCTTCATGGACGCCCCAACATCCATGGTTCCTGCTTGAGTC 

CCTGCTTCCATATTCCTGCACTTCTCACTTGAAATATCCCTGGAGTACGT 

TAAGCAGCCAGGTTTGGAAGTTCTTGCTGTGCAGGCGGGTGTGTGCATGT 

CCTCTCTCTCAACAGGACACAAGCTCCCCAAATCAGACGGTATGCCTCCA 

CGCCCCTTCCCAAGCCTCCCCAGCAGCACCGAGCATGTGAGGGGAGCTGG 

GGCCCAGGCCATGATGGGAAGCACTCTCTGCCTAAAGACTAGGGTGATGC 

GCCCTCAACTGTGGGAATGAGCCCCAGCTCTGGTGTCTGCCTCGGTTTTT 

CCTCCTGGACAATCAACATGAACTCCTCACCCCTCTTATCCACTTTGCAT 

AAACTGAAAATAACAAACCCAGGGTCTTTCTGTCACAGGAAAGGGTTTTT 

TTTTATAAGATTAAACAGAGATGATTCAACACACCCAGGATATAACACAT 

GGGCCATGAGTCAAGGCCAGGCATTGCTCTGGTCAGCCTGTTGTTTGGGC 

CCCCTTGGCAGGGCTCTCCCCTGAATCTTCCCCCTCTTGACTCCCCATCA 

CCACAGCACGTCCAGCTTTGGGTACAAGGCCAG7AAATGGGGAAGGGGGT 

CAGATGACATAAAG AG CC CTTT C CTGT C C C ATTG AAAT AT ATTTGG AT AA 

CAGATGGCATTTCCCCCTGTGTCTTGCCCAGGGCCCAGAGCCTCCACTTG 

CTAGAGGCAGACAGAGGATGGAGAGCCCCTTCATTAGTGGGAGGACATCA 

CAGGTGGGCAAGAAACCACAAGCTTGCACTGAGGCCCAGCCTTGAAATAG 

CAGCACCTGCCGGCACCTGTGGTCTGGGGACAGGGTCACAGGATGGAGGG 

GCCTCCTAAGCCTTTTATCTCTATGTACTAAGTACAACCCATTTTCCCAC 

CTCACAGAGCCAGATCAGCCTCTGTGAGGTCCTGGTGGCAAAAGGATAAT 

TGCCTGCCCGCCTGCCCGCGGTGGGGTGCTTGTGCTTGCATTCCTGGGAA 

GGTTGTTGGGTTACTCTGCAATAGGTCTCTCTGACCAGCTCACCCTCCTA 

CTGCAAACCTCAAACCAACTTCAAAGAAGATCCAGCACC 

>Contig24 

CGCGTAGTCTAAAGACTGAGTCTGAAGCTGTCCCTTCCTGCTATGGACTT 

CAGATTTTAGCCCACTTGAATTGCTCCATATCCTCCAAGCCATGGCCATC 

CCTTGACTCTCTGGGCTCCCAAGCACTTGCTGCCTTCATCACACAGTTTG 

AGTT AAGG C AG AAAG ACT GGTTT C CATGT ACACTTT GTGGAAG CTTT CT C 

ATTTCTTTATATAATCTCTGTCCTTTGTCTACTGCTTTAAAATCTAGAAA 

TTGTTTACAAACACAAAGGTGATCCTTTAAAAGCTCAAAGCTGATTGTGT 

CACCAATATATACCACTCTTAATGGCTTCCCATTAAACTTTGAGTAAAGA 

CTTTATGGAGCCTACATAAGGCCATGACTACCTGGCTCTTATTTTCCTCC 

TCATCCTCATCTCACCAACTCACTCTCCACTCCTATACCCCTCACTCCTT 

CCCCCTCCTCTCTCTGAGCTCCAGACTCCCAATTACCTACTTCCACCCTT 

TTTGACCCCCAGGGACTTATCTCAGCCTGGAATTTTCCCTCTTTGCTCTC 

CACTGAACTGTCCACTCCCAGTCTAAGACATGTGCTTATGTCACACGCCC 

TTACCGTGCTTATCTCAGTTTGTAATTATCTACTCATTTAGAAAAGTGTT 

GATGAAGGTCTTCACTGTCAGCTTTCAGGATAGCAGGAATCATAGCTGAT 

TTTACTTACTTAACGGGGTTTC1ATTCTTTGTAACTTTTTTTTTTTTTGAG 

ATGGAGACTCACTCTTGCCCAGGCTGGAGTGCAATGGCATGATCTCGGCT 

CACTGCAACCTCCACCTCCTGGGTTCAAGTGATTCTCCTGCTTCAGCCTC 

CCGAGTAGCTGGGATTACAGATGCCTGTCACCACGCCCAGCTAATTTTTT 

GTATTTTTTGTAAAGACGGGGTTTCATCATGTTGGCCAGGCTGGTCTCGA 

TCTCCTGACCTCAGGCGATCCACCCACCTCAGCCTCCCAAAGTGCTGTGA 

TTACAGGCATGAGCCACGGCACCCAGCCACTCCTTTTTTACTTATGGGTG 

AGAAGCCATTAGAGATCATTTCTTCTTTTCTTTCTCTCTTCACTAAGGCA 

CCAGGGTCACTAAGTAGTAGGATACTTTGAACTAGAACTCAAGAAATTGA 

GTTTTAATTTTACCTCACACTCTCATATGAATTCTCCATGTGACCTCGGG 

C CATACTT C C C CTGTACC CTGTTTC CT CTTTT AT AAAAGT AAGAGTTTAA 

ACTAGATGGTCTCCGACATGCATCCTTCTCTAACATATTCTGGAACCTTC 
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AATAAACTAAGATAAAC _*GAATAATTAAAACTTAATTTAAAAGAA€lAx - 
GGAAAGGAAGCAGTTACAT7AAGCAAAAGAGACATCTTCATGGTTGAAGA 
AGTGTATGCCCTGGTGTCTGGATCCCATTTAGGAAACTTGGTAACCTTGC 
AATCTTGGGCAGATTGCTTAATTTCTCTAGACCATGACTTCCTCTTCTGT 
AAGATG7GATAAGAACATCTACCTCACAGGTTTCATGAGAGGATTAAATG 
AGATAATGTATTATAATCCCTTGAACATGGTAGGCTGTTATGTTAAGTCC 
TTTCCTCCTTCTCTGTAGCTATCATGGAATTTAAAAACACATTATAACTA 
GAGCATGAGTTGCGACTAAAGGCTCAATTGTCTCTGCATGTGTTGGCTCA 
TGCATGCTTTATTCCTCTGAAGAGCTTTTATACCAAGTGAAAGGAAATAA 
TTGCATTTCCCTGAAAATTCACAGGAAAAAGTTATGTTTTTCTCTTCATT 
CAAGTGATTCTGTTAGACCCAACCACATGCAACAATTTTAAAGTTGCTTC 
CAAATATATTTACAAATATTTCCTGTCTTCAAGGAACAATGGCAAGACCA 
TGACTCAGGTTCACATCCGGATTCCACCACTAACCATGTACCCAATTACT 
TCAGTCACCTTCATTCAGGTCTTACATATCACAGAATAAAATCAGATTTC 
ATCAGAGGAGGTGAAGACAGGGAGAGGAGATATTTCAATCCCTTCTCCGC 

ATAGCACGTTTGAGGGGGAAAGACCCTAAGGATGATCTTTATAAGCCATC 

ACTTGGTGTTGGTGGTGATAAAAAACTCGAGTATCTTTATGCAGTGGAAA 

GAGAAGATTGGACTCGGAATCAGAAGCTTGAGTTCAAGCACTGGTTTCAT 

CAGTCTTGTGATCTTGGGTTGGTCACTTAACCTCTTCAAGGGTCCTCAGC 

TGTGAAAGAAGATAGTATCAGCTAATTCTTGTATGTGCAGTGAGGAGGCA 

GTGAGATAGTGCAGGTAAACTATAAAACAATTGTCACATGAAACGCATCA 

CAGTGATTCTTTGGACCCACAAGCTCCAATCTTATAAAACATATCCAGTC 

ACCCACCAACATAGATCATCTCACCTTGCATATCTGATTTTGTGGATCAT 

GGGGAAAAACTGCTGATTCCTAGCAAAACCCATGGCATAGGATAAGTGCA 

CAATAATTTTTTTTTCCTAAATGATTTAGATGACAGTGACTCATTAAGGG 

TTTCCTGAGGCCTCCTCAGAGTCGAGAGGTGGGTGCCTGAAGCCACCCAA 

AGTCCCTGTCACAGGATGGCTCCCAACGCACACACCACAGGCCTGCCCAG 

TATGTTCCACTATCTACCCAGTAGAGCCCTGCCCAGTACGTTCCACTGTC 

CCTTCCCTAGAAGAGGTGACTGTTGTTCACAGTCCCAGAAAAGCGGGCTC 

CCCAAAACAATGCAAGGACCCACCTCTCTCTGAACCTCACCCACCCTAGT 

TTTCCTTTAAAAATCAATTTACAAGAAGATCATGTGAAGGAAAAGGTTGG 

GTGATATTCTAACCCAAGTTAGCTGTTTCTCAACCAAGTTCTCTTTGAAA 

AATTCAACAACCACCTTTGGGGAATTATTTACAACAGAGGAGTGAGGATG 

GGACCAGGATAGGTATTGCCTATGTTGGTGGAACCAGGGTTTTTTTCCTG 

GATTACCAAAGAGATGGTATGCATTGCTCCCAGAAGCTAAATATCTTCAG 

GCTTTCAATGGTGGCCTTCACCTGAAAATGTTATCCCTGTTGAAGCTTTC 

AAGCCAGTATTTTCATAAGAACTATATTTTCTTTGGTGAACTGAGGCATT 

ATAATGATGACTATACAGGTTCTTGAGTGACTGAAGCCATCATTAGCATT 

GTCATTATTTTTGTTTAGTTGCATCTCCATAGCAGCTCACATTCACAATG 

TGCTTTGCAATTGTTCCTTAGCAATAGCCCTCACAAGATTCTCAGGAGGA 

GAGGGT7AATCCGGATTAACATTTCTGTGAAGCCTAGCGAGATTAATCGC 

>Contig2 5 

AAGAGTTTTAAAATTAAGTAAGGACGCCGGGAAACAAATCAATCCCAGCA 

AACATTTTGTTGGGATTTATCATTCAAGCAATTTTACAGTTATCCCTGTC 

AAATACATTAAGTGTTCAAAATTGGGCATAGGGGGAACAAAATAATAAAC 

CCAGCCAAAACAGAATAATCCCTGTTTGTTCAATGTTGGATAAAAAAGAC 

ATTACTATTGGTGTAAGGAAATTAGATACATCTTCCATTATTTAGTAAAA 

TTACCATAACTTCTAACTTTGTGGCTTTAGGCAGTCTAGTCCACAGGCAG 

GAAGGAGGTTTGTTTTGGCAAATGACTGTTATCATCTTCTGTTTCAAAGC 

TAAACCATAAACTAAGTTCCTCCCAAAGTTAATTCAGCATATGCCCAGGA 

ATGAACAAGGACAGCCTGGACGTTAGAAGCAAAATGGAGTCAGGTAGGTC 

AGATCTTCTTCACTGTCTCAGTGATGGCAGTTTCATAACTTTAAATGATG 

GCTATCACAGTTTTCATAAATAATCTAGATAAACAGTTAAAATAAAATAA 

TTAGGTAAATGTAGTGCGATAAATATTAGTAGACAAACTCACCATAATTT 

AGAATCTAAAGTTAAATTAAATAATAATATTTCATTATTTGGTATTTTCC 

AAGAAAAACATATTGTAGGAAACCATTCTTTTTAAAAAAAAAAGTGTCCT 

TTTAAAAAGGTGAATAATTTTTGTCTAATTCAAAGTTTATTGAAAAGTTA 

T GT AT AAAACAAGGT AAAAGGAACAAGG AAAT AAGGGAAATGTAAAGAAA 
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ATTATAGAAATAAAGTGw fATTTTTTGGTAAG AAAGCTTAAAGRGAjSIA- *%. 

attttaggtaagaaagaatcttacctaaaattttgtgctagaataaagtg 

actggctaagaaagggatgttcaaagctatttatgacaaacccacagcca 

atatcatactgaatggg<zaaaagctggaaacattccctttgagaactggc 

acaagacaaggatgtcctctctcaccactcctattcaacatagtatcgga 

agttctggccagggcaatcaagcaagagaaagaaataaagggtattcaaa 

taggaagagaggaagtcaaattttctccgtttgcagatgcatgattgcat 

atttagaaaaccccatcatttcagccccaaaactccttaagctgataagc 

aacttcagcaaagtctcaggatacaaaatcaatgtgcaaaaatcacaggc 

attcctatacaccaataatagactaacagagagccaaatcatgagtgaac 

t c c catt cacaattgctacaaagagaat aaaatac ctgggaatacaactt 

acaatggacatgaaagaccttttcagggtgaactgcaaaccactgctcaa 

ggaaataagagaggaaacaagcaaatggaaaaagattccatgcttatgga 

taggaagaatcaatatcgtgaaaatggccatactgcccaagtaatttata 

gattcaatgctatccccatcaagctaccattgactttcttcacagaatta 

gaaaaaactaatagccaagacaatcctaagcaaaaagaacaaagctggag 

gcattgtgctacctgacttcaaactatactacaaggctgcagtaaccaaa 

acagcatggtactggtaccaaaacagatatatagaccaaaagaacagaac 

agaggcctcagatataacaccacacatctacaaccatctgatctttgaca 

aac ctaac aaaaataagcaatggggaaaataatt c c ct atttaataaatg 

atgttgggaaaactggttagccatatgctgaaaactgaaactggacccct 

tccttacaacttatacaaaaatcaactcaagatggattaaagatttaaac 

atggctgggcatggtggctcacgcctgtaatcccagcactttgggaggcc 

gagatgggtggatcatgaggtcaggagatggagaccatcctgactaacac 

agtgaaaccctgtctctactaaaaaatacaaaaaattagctgggcatggt 

ggtgggcgcctgtagtcccagctacttgggaggctgaggcaggagaatgg 

tgtgaaaccaggaggtggagcttgcagggagtggagatcacgccactgca 

ctccagcctgggcaacagagtaagactccatctcaaaaaaaaaaaaaaaa 

aaaaaaagaaggatttaaacataagacctaaaaccataaaaaccatagaa 

gaaaacct aggcaatac cattcaggac ataggcatgagcaaagacttcat 

gattagaacaccaaaagcaattgcaacaaaagccaattgacaaatgggat 

ctaattaaactgaagagcttctgcacagcaaaagaaactattgtcagagt 

gaacaggcaacctacagaataggagaaaattttttcaatctatccatctg 

acaaagggctaatatccagaat ctacaaggaatttaaacaaatttgcaag 

aaaaaaaaacccatcaa; aagtgggcaaaagatatgaacagacacatctc 

agaagaagacatttatgt ggccaacaaacatgaaaaaaagctcatcatca 

ctggtcattagagaaatgcaaattgaaaccacaatgagataccatctcat 

gccagttagaatggcgattattaaaaagtcaggaaacaacagatgctgga 

gaggatgtggagaaataggaatgcttttacactgttggtgggagtgtcag 

ttagttcaaccattgtggaagacagtgtggcaattcctcaaggatctgga 

ac cag aaataccatttgac ccagcaatcccattactgggtatatac ctaa 

aggattagaaatcattctattgtaaagacacatgcacatgtatgtttatt 

gcagcactattcacaatagcaaagacttgggaacaaccctaatgcccacc 

aatgatagactgtgtaaaaaaatgtggacgtataccccatggaatactat 

gcagc cataaaaaagaatgagttcattcttttgcacggaactggatgaag 

ctggaagccatcattctcagcaaactaacficaggaacagaaaaccaaaca 

ctgcatgttctcactcataagtgggagttgaacaatgagaacacatggac 

acagggaggggaatgtcacacaccagggcctgtcaggaggtggggggcaa 

ggggagggataacattaggaaaaatacctaatatagatgacgggttaatg 

ggtgcagcaaaccaccatggcacatgtacacctacgtaataaacctccat 

gttcttcacatgtatcccagaacgtaaagtaaaatttaaaaaagaaagaa 

agaaagaaaaggatgttcacgacaaaccagaaagtccaagcatgtcatga 

atagtctgtgtaagtcacaataagaggatttatttaaaaaaacttttata 

tgataaagttgtctataattaaagggaaattataatggtctttctagaga 

ttgggttgatgttaaaaaactacttatatattaaaaaattggttagaaca 

atgaaattttcttacggggttgattcactcttaataaattataagagact 

TAAGAATTTTTTTTTAACCCAAAGTTCAGCTTTTATTGCATCTTGCTGTT 
TTAGGTTTTCTCTCCCCTTTAAAAGGGTGGGAAATAGTAATGCCCTCCTT 
CAACTCCZTTCAGCTCATATACGTTTTTTACCCTCAGATTCTGTTTGTTG 
TGTCCTGATGCTAACAATGTTTTCTTAAAGG7CTAAAGGAAATGTTTTCT 
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T C CAACAT AAT ATT CTG1 ^CATTGCAGAAGGTCTTTTCTTTTGCCTTTTO * 

3T AACTGGCTTAACAGAT TT T A TGTTTTATTGAAATAATTTCTATG C CAT 

TATTATTAAGTTTTGGTTTGCTTAGAAAACACTGAGATTAATACAATTTT 

TTAAAAATTATGATTATTACATCCATATATCTTTATGTATGTGCTTTTAA 

AGTCCTTGTGACATTGAGTTCTAGGGCTTGACTCCTGGGTCTTAAAAGGA 

CAAGTCCTGCTAAATCTTAAATACTGACAGCAATTAAAGGCTCATCTTCA 

GGACTGGTAGAAAATGCCAATCAAAATAAACTGCATTCTTGAAACACAGA 

GCCAGAAATTAAAGCTATTCAACTCAAGGCCCAGGAACTATAGTGGAAGA 

GGTGGGTGTGTGAGATTGTAAGGGCCAATTTTGAGAGATAAAATAAGTTC 

AATTTCTCTATAAATTAATCATAATCATTGATGTCCAAGCCACACTGATG 

CAAGATCAGCATATGGGTCCTGTGTCAGATTAACAAGGTTTTCTTGAAGC 

ATTAACCTACTCCTTAATAAAGGTTATAGAGGTTATAAAAGGCTTCTGGA 

AGTTATAGCTATGGTCAAGATAAAAATTTCATAGATTGTTAATACAATTT 

TGGAAAACAAATTTAATTGGCTTCTTGCTGTTTTTATTAGGGCTTATTGT 

TTGGAAAATTAAGTCTCGTCTCTCAAAGAATGAAGGCTTTCACCTTTTTT 

TTTTTTTTTTTTAATCCTTGAGTTATCACTTTGGTCAAATGAATGACTTA 

TTTTACAATGACCTTTCATCAAGTGTTTTAAACCTTTCAAATTTGACAAA 

CTTTCCAAAATCAAACTACAAATTATGTCTTTTTATGACCTAATGAATCC 

TTTAAAATACTAGGTTCCCTAAAGTCCAAAAAAAAAAAAAAATAACATAA 

TGTGGCTTATTTGGTATAAAAATTTTACAAGAAACATTGTCAAATATAAA 

ATATTGTGTGGTTTTGTTTGGGCTGTATTTGTATAAATATGTTATTGGTA 

TGTGTTCCAAAATTATAGGAAACTCCTATAATTCTGATATGACTTGGTGT 

\CATTATCAGTAATAATTATAATTGTTATGGTAAATTATTGTGTGCCATG 

GAGGTAACAAATTTCCTCATCAAGTGTGTCTTTGACTATGGTTGCCCTAA 

AACTTTTTGCCATTCACAGACAATTGTCTTGCTTTGGTCCTCTTTAGAAG 

GTGGTTTTATAATCAGCTATAAAACTCTAACGGGTGCTCTTGAATGCAGG 

CTTAAGATAGCTTTGGAGACTGTGACATCAGAATAGAGGAAAAACTTTCA 

GTATTCATGGAGTGCTGAAATATTCATGAATATCAAGCAAAACAGGAATT 

AACTTCATAGATGGAACTAAAAGAATGCTGAAGTAATCTTTTTGACTTTT 

TTTCTTAGAATGTTGATCCTTCGTTTTGTTTTTCAGAGTCNAGGAAATTT 

TTCTGTTGAGATATTGACAGCTTTAACAATTAAGTATACTCCAGTGAACA 

CAATTTGGAGCA 

>Contig26 

ATCTAGTCATTCCCCAGCCTGACCAATTCAATGGCCCCCATCTTAGTTAA 

AATTCCTCACCCTGACAAGGCCCCATCTACGCCTCTGACCTCATGCCCTC 

CACTCTCAGTCTTGCACTCACCCTGCCACACTCAAGGGCTTCCCCAGGTT 

CCTTCTTAGATTCCACCGATAGCTCAGGGACTTTGCACATGCTACGGTCT 

CTGCCTGGCTCCTCCCCAGATCTTCTCATGCCTAGCTGCTTCTCATCAGC 

ACCCCTCAGAGACTGTCCCTGCCCCACCTCTCCAGGTTCCATACCTGCCA 

CCCTCC C C C AATCACGTAACAGTTTCTTCACAGAGCGAGTTACCATCCCA 

GTATTTCCCTAACTTATTTTTTGTGACTGGTCTGTTGCCTGTCTCCACCA 

CAAGAACATAAGCTGCATGTGAACAGGAGCCTTGTCTATCTTGTCACCCC 

AGTGGCTGTGACATAACCTGATACACATTAGATGCTCAATGATGTTTGAT 

GAATGAAGTGCTGGTAGTCCAACTGTGTTTCCTTGTCTGTGTAAGTATGT 

CTGTTGTGGTTTCCTAAGAACCTACAGCTCTCCCACTGTGACTCCTGTTC 

TATGGTCCTGATTTGCTGGACTAGAATCCTAACCTACATGCTTACTCTTA 

GTGTCCTCCCCCAGAGGCTGAATCCCAGTCCCTAAACCTCCACCAAATGG 

CTAAGACCTAGCTTCCAACCAGACAGGCCTACGCTGAGACCTCAGCACCG 

CCCTTCTGCGGTCTCATCCTTAACGCATCCTTCAGGGCCCAGCTTAAATG 

TCTCTTCTCCAAGGAAGGCTATCCTCTTTCTGCCCCTCAGTGCTCTCCAT 

GCCTCCTCTATGCCTCCATGCCTGCTTTCAACCCTGCAGAAGTGGAGAAA 

TTGCTAATCTGCTGTGTTGACACTGTGCTGGGGTGCCTTGGGCCAGGGAG 

CAGGCTGGTGGTGTGCTGATAGCCCGTGGCTGTGCCCAGGTCCATGCTCA 

CTTCCTGAGCCCCAGTGGAGTAGGCTCCCTTTCCCTTATTGCAGCACTCA 

GAGGAAGGACGTGCTTCTTAGGACAGATCTGGCCAACCTCTCCCTCGTGA 

GAGAAGGCCCAGCCATCCTCTTGCCCTCTTTCTTTCTCCTGCCCCCGAGT 

AATAAAGGTGCCTGGTCAGAGCCTTCTAGAAGGAGACCCAAACATCCACC 

ACACATTCCCAGTTCCAACCGTCATCCACATGGCTGGCTGTGCAGGTAAA 

CGCAGAGTCTGTTTCACACACCCAACCATCTAGTATTGGATGGGAGGACA 

GTAGCGTGACACTCTTCTCCAGCCTTGAGCCCTACTGTGGGCCCCACCCA 
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ACCCAGATACCAGAGG«JCCCTGTACTGGGATGCTATTGGAfGGTfG'i^€- 

AGTCATGTACAAAGTTAGCCCTTTGTTATATAGAG7TAGCTACGTACATC 

7TCCTCTGTAGGGAACCCAAGAGGGGAGAAGAGATATGTAGTAGGATTTA 

ACCTGCAAATCCTCTGCTGAGCACCCTGCACTACATACAGTGGGTAGCAT 

GTGGTAGGTGCTCAATAACTATTGACCGATAGATTGAATACAGGTAGGAT 

GGTGACACAATCTAAGATCCCAGGGGTGGGGAGACCACACGCTTGGTTAG 

GGAGACCCAAAGTGGACCGTGTGGCCAGAAGAGTCCCGCACTGCACTCTA 

GTGACAGTGCAGAAAGTCACTGTGGGAAATCTAGAAGTTTCTACAGGTTG 

CTATTTCATCATAGCACTGTGCAGGCCAACCCTTCCTGCTCCACTGGCTG 

TTGGGAAAAGCTTTCTCTTTTCTTCCTAGCCAGGGAGCTCTCAAAGTGTT 

CCACTCTCTCACCTCCACCCAGGCGTCCAGGTGTGGAGGACACTTGCCGG 

CTGCTTGTCTGCTGACTCATCCCTTGGTTTCACTTGGAAAACCTACCACC 

AGCTGGCCTCTTTCCAAGCATCAGCCTCCTCATTTTCTTAATCCCTTAGG 

TGTGATCTCACCTCCACACAGTAGATTGCCTCAAGGCCCAATTCCAATAT 

GAATAAAAATGATTATTTTGTCATCTTCCAATCTTCCTTTTAAAATATTA 

TTTTATAATTCCCTTTAGGAGGATCACCTAAGTGAAGACTAXTTTTACCT 

AAGAAATGTTAAAATGTAAAGACATGGTTGTAATCTGGGGATTCCTGTTA 

AAATGGCTAGCAGACAGAAGTCAGACGACAGGCTAGAAATGTGTGAAGAG 

TGGTTGCCTTTGAAAGGCGGAGTTGGTAATGATTTTCTTCCATTTTTCCA 

TGCTTTCCAATTCTCTACAAAGGCCTTAATATTACTTCGATAACCAGGAC 

CTCTGATAACCTGCCCCCACCGAGTAAAGACTTAGCTGGGAAAGTCAGCT 

TCATGTGAGGTAAAAGGAACCAGGTAATACACAATTCCCACTGCCAACTG 

TCGGGTGTGCAGGCCTGAGCTTCCTGCATGTGGGAGGAAAGAGAAAGAAG 

AGAGAAACTCCAAGATCCAAGAGATCCAGCAAGAAGGCTGGAGTCTGAGG 

ACGCAGAAAGCTGAATGGCACAGTTACCACTATTGTGCTGAGGTTCTGTG 

GCCTCTGGGTCTCTTGACAACTGGGCAAAGACCCACAGAAAACTATCTCT 

AGACCCTACCTGTGGGAGGGGAAAGTGCTTAAGATCATTTACAGGACAGC 

CACCTGGACCTCAAATGGCTTACAGTTCCTTCATCCAGAGGGTCTTCATT 

TAGTACATACCAGGfTGCTAAGCTGGGTGCTGGAGACATGACGGGGAACCC 

ATTT AC CAT GG CTTTGTTACTGTGACATTCACAT CTAGGG AAAG C CAG CA 

AAGGGGAGGGATCGAGGAGAGCTTGTTAGGCAGAGAAAATACCCAAGGGC 

AAGGGAGAAGCCAGCCTGTTCTGAGCACACACAGTGGTTCCATCTAACTG 

GGCCTCAGTGCCAQGTTGGACTGGAGATGGGGCTGAGGAGCTGTCACAGA 

GCATTCTGGACACAGATGTCACATAGTCCCTTGAGGTTAGGGTCCTTAGG 

CATGGGAGCATTGCTTTGAGTTTTTCCTTTTGTAATGTTGCCATTCATGA 

CAATGTGGAAGATGGGTCCTTGCAGAGAAGGGCAGGGCTGTGAGACCAGT 

TAGGAGACTAAGATGTGAGCCAAGGAAAATGAGGAACACCTGAACACTGG 

GGCAGGTGCAGGGCCCAGAGAGAAGCAGATGGCTTCCTGAGGTTTTAAGT 

AGGTAGAATCAAGGCAGCTGGTACAGATCTTTTATTACATATAAACTGGA 

ATAAGCCATCTGTTCCAAGACAAAAGAGTAGGCGGAAAACAATACAAGAC 

AGAAATGGAATTAGAACAAACCTGGGAGGAATGTGGAATTAGAGTAGAGA 

GTCCAAGACTGGCTGCAATCATAAAAATGTAAAACAAACAAAAATTTGCT 

AGGTGTGCTTACTTAGAAATAATTAGCTGTCATATTAAGTTCACTTGTGT 

TATGGCTTAAATGTGTCCCCCAAAATGTGATGTGTTGGAAACTTGATCCC 

CAATGCAACAGAGTTGAGAGATGGGACCTTTAAAAGGTGATTAGGTCATA 

AGGGTT CTG C C CTCAT AAATGAATTAATACTGTT AT CATG AGAGT AGATT 

CCTGATAAAAGGATGATCTCTGCCTCCTGCCCACAGCCCTCTTGTGCATG 

CTTTCCTGCCTTTCCACCTTCTGCTATGGGATGACACAGCAAGAAGGCCC 

TCACCAAATGGAGCTCCTTGATCTTGGACTTTCCAGCCTCCAAAACTGTA 

AGCCAAACAAATTTCTGTTTATTATAAATTACCCAGTCTCAGGTATTCTG 

TTCTAGAAACACAAAATGGACTAAGATCATTAAATTATCATTTTTTATCA 

GACTGTTGA 

>Concig27 

AAAATATAACAGAGAGTAAGAGGAAAATTACCTTCTTTCTTTTTCCTTTC 
CCTGCCTGACCTTATTCACCTCCCATCCCAGAGCATCCATTTATTCCATT 
GATCTTTACTGACATCTATTATCTGACCTACACAATACTAGACATTAGGA 
CAATGTGGCCTGCCTCCAAGAAACTCAAATAAGCCAACTGAGATCAGAGA 
GGATTAATCACCTGCCAATGGGCACAAAGCAACAAGCTGGGAGCCAAGTC 
CCAAAATGGGGCCTGCTGCTTCCAGTTCCCCTCTCTCTGCATTGATGTCA 
GCATTATCCTTCGTCCCAGTCCTGTCTCCACTACCACTTTCCCCCTCAAA 
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CACACACACACACAACACaJCTTAGATGTTTTCTCCACTGATAAGTAdbTu'- 
ACTCAATTTGTAAGTATATAATCCAAGACCTTCTATTCCCAAGTAGAATT 
TATGTGCCTGCCTGTGCTTTTCTACCTGGATCAAGTGATGTCTACAGAGT 
AGGGCAGTAGCTTCATTCATGAACTCATTCAACAAGCATTATTCACTGAG 
AGCCTTGTATTTTTCAGGCATAGTGCCAACAGCAGTGTGGACAGTGGTGC 
ATCAAAGCCTCTAGTCTCATAGAACTTAGTCTTCTGGAGGATATGGAAAA 
CApAC AAC C CAAACAACCAACAAAAGAGCAAGATGCTGCAAAAAAAAAAA 
AAATGAATAGGGTGCTAAGATAGAGAAAAGTGGGAGAGTGCTATTTAGAC 
AAAGTGGTAAAAACAAAGCCCCTTGTGAGATGAGAGCTGCCGACAGGAGG 
GGGCGGGTCATGGTTGTGGGTTTTTGGGTAGGACATTCAGAGGAGGGGGC 
GGGTCGTGGTTGTGGGTTTTTGGGTAGGACATTCAGAGGAGGGGGCGGGT 
- CGTGGTTGTGGGTTTTTGGGTAGGACATTCAGAGGAGGGGGCGGGTCGTG 
GTTGTGGGTTTTTGGGACATTCAAAAGAGTCTGAATGCACCCAGGCCTAC 
AACTTCAAGATGGTAAAGGACAGCTCCAAGGATCAGAAGAAGCATGCTTG 
GAACTGGGGCATTTTGAGAAGGAGGAAAAATATGCAGAGACTAGTGCTTG 
CAGAGCTTGCATGTGGATTTCATTTGAGGTACAATGAAAACGCATTAATG 
GGTTTCACACAGTGCAATGGCCTGACCTCACTTATATTTCCTAAAATAGA 
AAACAGATCAGAAGGAAGGCAATAGAGAAGCAGAAAGTCCAATGAGGAGG 
TTTCACAGCAGTCATGGGGGTGGGGTAAGGAAAAGAAGTGGAAAGAAACA 
GACAGAATTGGGTTATATTTTGGAGATAGAACCAACAGAAGGAAGAGGAG 
AAACAACATTTACTGAGAAGGGAAAAAGTAGGAGAGGAATAGGTTTGGGA 
AATAAATCCTGCTGACATTGGAAACCCCAAGGAAGCCTCAAAAGTATATT 
T AC TT G CTTT AGATTTAAAAG AAT AGGAAAGAAGCAT CT CAACTTGG AAT 
TTGAAATCTATTTTTCCATAAAAGTATTGTTAAATTCTACTCATACTCAC 
AAGAAAAGTACATTCTAAAGAGTATATTGAAAGAGTTTACTGATATACTT 
AGGAATTTTGTGTGTATGTGTGTGTGTGTATGCGTGTGTGTGTGTTTAAC 
CTTCAATTGTTGACTTAAATACTGAGATAAATGTCATCTAAATGCTAAAT 
TGATTTCCCAAAGGTATGATTTGTTCACTTGGAGATCAAAATGTTTAGGG 
GGCTTAGAATCACTGTAGTGCTCAGATTTGATGCAAAATGTCTTAGGCCT 
ATGTTGAAGGCAGGACAGAAACAATGTTTCCCTCCTACCTGCCTGGATAC 
AGTAAGATACTAGTGTCACTGACAATCTTCATAACTAATTTAGATCTCTC 
TCCAATCAACTAAGGAAATCAACTCTTATTAATAGACTGGGCCACACATC 
TACTAGGCATGTAATAAATGCTTGCTGAATGAACAAATGAATGAAGAGCC 
TATAGCATCATGTTACAGCCATAGTCCTAAAGTGCTGTTTCTCATGAAGG 
CCAAATGCTAAGGGATTGAGCTTCAGTCCTTTTTCTAACATCTTGTTCTC 
TAACAGAATTCTCTTCTTTTCTTCATAGGAGATGCCTGAGATACCCAAAA 
CCATCACAGGTAGTGAGACCAACCTCCTCTTCTTCTGGGAAACTCACGGC 
ACTAAGAACTATTTCACATCAGTTGCCCATCCAAACTTGTTTATTGCCAC 
AAAGCAAGACTACTGGGTGTGCTTGGCAGGGGGGCCACCCTCTATCACTG 
ACTTTCAGATACTGGAAAACCAGGCGTAGGTCTGGAGTCTCACTTGTCTC 
ACTTGTGCAGTGTTGACAGTTCATATGTACCATGTACATGAAGAAGCTAA 
ATCCTTTACTGTTAGTCATTTGCTGAGCATGTANTGAGCCTTGTAATTCT 
AAATGAATGTTTACACTCTTTGTAAGAGTGGAACCAACACTAACATATAA 
TGTTGTTATTTAAAGAACACCCTATATTTTGCATAGTACGAATCATTTTA 
ATTATTATTCTTCATAACAATTTTAGGAGGACCAGAGCTACTGACTATGG 
CTACCAAAAAGACTCTACCCATATTACAGATGGGCAAATTAAGGCATAAG 
AAAACTAAGAAATATGCACAATAGCAGTTGAAACAAGAAGC CACAGACCT 
AGGATTTCATGATTTCATTTCAACTGTTTGCCTTCTACTTTTAAGTTGCT 
GATGAACTCTTAATCAAATAGCATAAGTTTCTGGGACCTCAGTTTTATCA 
TTTTCAAAATGGAGGGAATAATACCTAAGCCTTCCTGCCGCAACAGTTTT 
TTATGCTAATCAGGGAGGTCATTTTGGTAAAATACTTCTTGAAGCCGAGC 
CTCAAGATGAAGGCAAAGCACGAAATGTTATTTTTTAATTATTATTTATA 
TATGTATTTATAAATATATTTAAGATAATTATAATATACTATATTTATGG 
GAACCCCTTCATCCTCTGAGTGTGACCAGGCATCCTCCACAATAGCAGAC 
AGTGTTTTCTGGGATAAGTAAGTTTGATTTCATTAATACAGGGCATTTTG 
GTCCAAGTTGTGCTTATCCCATAGCCAGGAAACTCTGCATTCTAGTACTT 
GGGAGACCTGTAATCATATAATAAATGTACATTAATTACCTTGAGCCAGT 
AATTGGTCCGATCTTTGACTCTTTTGCCATTAAACTTACCTGGGCATTCT 
TGTTTCATTCAATTCCACCTGCAATCAAGTCCTACAAGCTAAAATTAGAT 
GAACTCAACTTTGACAACCATGAGACCACTGTTATCAAAACTTTCTTTTC 
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7GGAA7G7AA7CAA7G1 ~ rC77C7AGG77C7AAAAA77G7GA7^GACL-* 

TAATG7TACATTATTATCAACAATAGTGATTGATAGAGTGTTATCAGTCA 

TAACTAAATAAAGCTTGCAACAAAATTCTCTGACACATAGTTATTCATTG 

CCTTAATCATTATTTTACTGCATGGTAATTAGGGACAAATGGTAAATGTT 

TACATAAATAATTGTATTTAGTGTTACTTTATAAAATCAAACCAAGATTT 

TATATTTTTTTCTCCTCTTTGTTAGCTGCCAGTATGCATAAATGGCATTA 

AGAATGATAATATTTCCGGGTTCACTTAAAGCTCACATTACACATACACA 

AAACATGTGTTCCCATCTTTATACAAACTCACACATACAGAGCTACATTA 

AAAACAACTAATAGGCCAGGCACGGTGGCTCAGACCTGTAATCCCAGCAC 

TTTGGGAGGCCAAGGTGGGAAGATCACTTGAGGTCAGGAGTTCAAGACCA 

GCCTAGGCAACATAGTGAGATCTCATCTCTACAAAAAAAAAATGAAAAAT 

TAAAAAATGAGCTGGACATGGTAGTACACACCTGTAGTCCCAGCTACTCG 

GGAGGCTTGAGGTGGGAGGATCACTTGAGCCTGGGAGATGGAGGCTGCAG 

TGAGCCATAATCACACCATTGCACCCCAACCTGGGCAACAGAGTGAGACC 

CAGTCTCAAAAGATAAATTTTTAAAAATGTTAAAAAATATATAAAAGAGA 

ATTTTAAAAGAACAACTAATAGATCAAAGCATGGATGCAAGATATATTTA 

GTTGGAAAATCAAGGTTAAAATCAAGGGATCTTGGAATTAGGTGTGGTAG 

ATTTGGGTAAGGAGTAGTCTAAGATGACCCTGTTTCTTGGTACTGGAGAC 

TGGATGAGTGGCAGCGTCTTAACCATATTTTTGGTAGAAATATGGAGGTC 

TTCTCCATTCCAGGATGAATGATGAGTAAAATTTTAGGCATGTAATTTGA 

GCTACTAGAAGGACACTCAATTGCAGATGTACAATGGGGAGATGATAACC 

TATCTGGAACTCAGAAAAATAACTGTATATAGATATGAAAGACATCAGTA 

GG7A7G7AG7AGA7AAAA7CC7AAAAG7GA7G7CAAAGGGAGAAGAGAAG 

TATATGG7GAACACTGTTGTTTGTCCATGCAATTGCCATCTCTTCTTCTT 

CC77AC7GACAGAACCC7GA777CAC7GAGAAG7CAACA7GCCC77CCCC 

AA77GA7GAA7CCAA77GG77GAAGA77A7G77CA77C7A77C77ACA7G 

AC7AAG7CACG77GAC77AA7CC7A7CAAA7GAGA7G7CGA7C7GGAAAC 

AAC77C7GGAAAAGA777TC7ACC77G A7AAAA7AAAG AGC CA7A7AGA7 

GG7CC777A7C77CC77CT7CC77GAA7GAGA7A7G77C7A7GAGGAAG7 

GAAGC77AGAAC7G7GG7CAGCAAC77GCAACGAC7GGGAAG7CAGAGCC 

ACACAA7GAAGAA7GCAGAG7GGAAGGAGAAAAAGAGCCAGCA7C7C7GA 

CAACA77G77 A CAC C GAGAAC C7 AC C 7 C CAG A7777 AAGAAAA CAAG AAA 

7GC7AC7G77A77AAGCCA777CAC7GGG777GC7A7GAC77GCAG7CAA 

ATCTAGCTTAACTGATACAGAGCACCACAGAGAACTGGTCTCTCATTTGT 

C7CA7CC7G77C777CTAGCAGCCACGAC777CC7AGGG777CC77AGCC 

CAAG7C7GGC7AGAGCAAGAC7AAG7AAGAC77GA77CC77AA7G7CC77 

77G7777AAGAAA7A77AAAGAA77A77777A7A77AA7A7A7777AAGA 

AA7AAGGAAA7ACAAAACAC7GAGCAAGCAACACAAA77CAAGAAA7C77 

AAAAAG7A7AATAGCTGCTCAGTCTCTGATTAACAGTGAAATATGGAATC 

ATTG7AGAAATGGCCTTGGAGCGTTATTCTCCCAGGCCAGCTATCCTTAT 

GG7C7GCCCCACC7CCC7CA77GCC7AAACAG7AAGAGAG7CCCA7GG7G 

AGAC7CAACAG7C77AGCACAGAAC77G77ACAG7C7A777C7777C77A 

CAG7CC7A7A7A7CAAT7CCAAA7CAA7GAGAG7AAAGCCCAA7CCC7GC 

C7TTAAACCCAAAGGACAGAAGCCCAAAGCCCAAAGA7A77CCC7AACC7 

7C7CCCCC7 

>Contig28 

CC7G7CGC7CCC7A7GTT7AAAGC7GGGGA7C7C77777CC7G7G7C7AA 
77A7777CC7CA7TGGC7TGAAAAA7C7GA7AAAACA7777AGGAC7G7G 
7A7AAAA7AGAA77AGCCAAG7GCAA7G7C777A77CAGAAGAAA777CA 
7GGACG77G7GCC7ACTC7C77GGC77CC7GGC77CA7GGC7T7CCAGA7 
CCCACAG7AAGC7CTGGA7AG7AGAAG77A7AG7AAGAC7GAC77C7AAA 
7AAA7GAAG7GAC777AACC77AC7GA7A7GGC77AAAGAAAAGGAG7GG 
CC777AAGA7CCA7GAAC77C7CAAACAAAAG7GA7AACG77A7C7CCA7 
GCA7A7A7AA7AC7AAA7A7AA7GCAAC7GAGAGAAG7AGGC7G7GG7AA 
GAAAGGAGACCCAAG7GCCA7C7GAAGGCAGCAC77ACCAC7C7GC77CA 
7CCCACCGAGGAAACAAAGCA7GAG7A77GCCAGA7777C77C7G777CA 
AGAAAAGCCAGAAA7CCAGG77777GCG7GAAA7G7CC7GA7777AA7G7 
7GGGAAC7AA777A7A7777GAAA7AACA77G7G7GGGACAAG7GAAC77 
3TATG7GGAACTGCTTTCTCCCAGTGGCGACCAGTTTGGACCG7TGATAC 
TCAGCAAG77CAGCCAAG7GCGCC77G7CA77G7CAG7CA7CAAGG7GA7 
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GTGTGATTGGTCAAACA^XTAGTTTTGCTCAGCATCTCGTGTGTTTTCAA 1 

AGGACCTGAGGGTTCATTTGCCCATGCAGATCTTGTAGTCCTGTTTATTC 

TATTAATTTATCTTGCAAATCTATAATGTTTTATTTTAAGCAGCGAGAGC 

CGTGGCAGCCTTTGGTCTGGACCCTTTCTAATGATCATTTAGTATCAGGC 

TATGTGGGAGTTGATTGTTTTGCATTGCCTGAAAGCCAACAGTATCACTC 

CTCCTCTAGGTGTGGCAGAGATGTGAGAGAGGGAGACTGACAGTCTGTGG 

GXPTGTATGCAGTGTTGGGGGAAGCGAGGCACAGGGGACAATAC'TGTGGT 

GTATAAAACTAGTCTAAGGTAGCATCAGGAAGTTCATGAAGCCAAAATGA 

TTTTCATAACAGCACAAGACATTATTTGTTTTTGCCTCCCTCTCATTTTT 

TTTTTTTTTTGAGACAGAGTCTTGCTCTGTCATCCATGCTCGTGTGCAGT 

GGTGCAATCTCGGCTCACTGCAACCTCCACCTCCAGGGTTCAAGCAATTC 

TCATGCCTCAGCCTCCTGAGTAGCTGATTACAGGTCTGCACCACCCCGCC 

GGCTAGT7TTTGTATTTTTAGTAGAGATGGGGTTTTGTAATGTTGGCCAG 

GCTGC C CTGTCATTTTTTTTTTACTAGTGTCCAGTGGAGTTTTTTAGGGG 

CTACATAACATGATACTGTCATTAATCTAATGGCTAATGAAAGGGATATG 

TATATGTTTTTGTGTTTAAAACAAACTTCTTTGGGGTCCTCAATAATTTT 

TAAGAGTATAAAGGGGTCCTGAGATCAAAGAGTTTGAGTTCTGCTGGACT 

GGGACAGTGGTTGTCAACCCAGATTGTACATTAGGGTCATCTGGGAAGCT 

TTAAAATAGTACTGATGCCCAACCTTACCGCAAACCAATTAAGCCAGAAT 

CTCTGTGGATGAGAAGTCTTCATTGTCATCATCACCATGACCATCATCAT 

TGTCACCGTCACTACACCATTATCATCATCATCATATCATCTTCATTATC 

ATTGTTAGTATCTCCATCACCATCATCAGCATCACCATTATTATCATCAT 

CATCATCCCCACCATCATCCTCATCGGAACTTCACCTGCATGGAGGACAA 

7CCACTATGCATTAGGTGCTATGCTATTTGCTATACTCCTTATTCTCACA 

ACTGCCCAGAGAGGCTGATATTATCTCACTTTATAACAGGAGGAATCTGG 

ATCGGAAAAGTTAAGGTAAGCTAATTCACAGAGCGAGAAGAGATAGAGCC 

AGGATTCGAAACCAGTTCTCTGCTACATCAATGTTCCCAGTCCTTGCACT 

ATTG AGAAC CTCTTTAGTTATGCTTT CAC C CCTC CAACAC CACAGTAAAT 

TTTTTCTTTTTTTAAAAAAATTATACTTTAAGTTATAGGGTATATGTGCA 

TAATGTGCAGGTTTGTTACATATGTATACATGTGCCATGTTGGTGTGCTG 

CACTCATTAACTCGTCATTTACATTAGGTATATCTTCTAATGCTATCCCT 

CCCCGCTCTCCCCACCCCATGACAGGCCCTGGTGTGTGATGTTCCCCACC 

CTGTGTCCAAGTGTTCTCATTGTTCAGTTCCCACCTATGAGTGAGAACAT 

GTGGTGTTTGGTTTTCTGTCCTTGTGATAGTTTGCTCAGAATGATGGTTT 

CCAGCTTCATCCACGTCCCTACAAAGGATATGAACTCATCCTTTTTTATG 

GCTGCA7AGTATTCCATGGTGTATGTGTGCCACATTTTCTTAATCCAGTC 

TATCATTGCTGGACATTTGGGTTGGTTCCAAGTCTTTGCTATTGTGAATA 

GTGCCACAGTGAACATTCATGTGCATGTGTCTTTATAGCAGCATGATTTA 

TAATCCTTTGGGTATATACCCAGTAATGGGATGGCTGGGTCAAATGGTAT 

TTCTAGTTCTAGATCCTTGAGGAATTGCCACACTGTCTACCACAATGGTT 

GAATTAGTTTATAGCCCCACCAACAGTGTAAAAGCATTCCTATTTCTCCA 

CATCCTCTCCAGCACCTGTTGTTTCGTGACTTTTTAGTGATTGCCATTCT 

AACTGGCACCACAGTAAATTTTTATAGATTTTATAAGCAAATTGTATTTA 

CTGTGCAAGAATTGGTTTATTTTTTAAACCATGTGTTGCAAACATACAAT 

GGTTAATTGTGATATTTGCTCAGTACAAGATCATCAGATCACTACACAGA 

CTTGAGGTAATTCCACCTAAAAGCAAAGAGAACTGACCCCACATTAACTG 

AGAAGTCTTTACTTATTTATTCCCTATAAACGAGCCAATATGAAGAGAAG 

GCCTTAATGTGGTTAACTATGTAATTTTTTTCTGACTTTTTGAAATACTG 

AGAAGAGCTCATGACTCTCCCATCTCCTAATTCTACCTTGGTGGATTTTA 

GACTGACCACAACTCATGGGTAAATGAGGGAAGACGAATAAGAAACCTTG 

CTTTTTTTTCCTCCTTGTTTTTGGCTGGCTGCAGTGGCTCACACCTGTAA 

TCTCATCACTTTGGGAGGCCAAGGTGGGAAGATCACTTGAGCTCAGGATT 

TCAAAACTGGCCTGGGCAACATAGTGAGACCCCATCTCTAAAAAAAAAAA 

AAAAAAAAAAAAAGGCGACAGGCGGTGCGTGCCTGTAATCCTACCTACTC 

AAGAAGCCGAGGTGGAAAGATCACTTGAGCATGGGAGGTCAAAGCTGCAG 

TGAACCTTGATTGCACCACTTCATTCCAGCCTGGGTGACAAAGCAGGACG 

CTGCCTCAAGAAAACAAAAACAAAACCTTAATTTTTTGGCTATTCTTTTC 

TGGTAAGAATGGTATAGAGATGGGGATGAGGATGGCTATTGTATGAGAGA 

GCAAACAGGGTCCAAGCAGTGCTCTGGGCTGTCTAAGGACCAGTAGTCAG 

CTTAACTTCTCAAATTTCCAGGGAAGGAGTTCGGAGTGGTAGAATATCCT 
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GGGTATGCCCAAAGCATLaCCTTGCAAATAGCCTGTCATGAATJ^TTTGx 

TTCATTTGTTATGACTGGAAACTGGCTTTGTGTATGCCAGAGAATGGGGG 

CAGGAAAGAGAGATTGGTGTCTTGAGCTCTCTGTGCCTCTGGGGCAGTGA 

TGCTTTTCCTCTCATGTGGAAGGAGAGCATGACTGAAAAGGTGCACAAAT 

AAGGTGTCTGTGAGAGAAATTAACCTTCCAGATACAGAGACACAACCTTC 

CCCAAGAGGTCCTCATTGCTCTGCCTTTTTTCCTTTTTTTTGCTTGTTCT 

AQCATTAATAACAGAAACTGATTATGACCTCAAAAGAGAGGAGAAAGCGA 

CTCTC C CCACCCTAGAGCTAGTTAACCACCATATCTTCCTAGATATC CTT 

GAGAGCAATGTAACCC 

>Concia29 

GTGAACTCGTTTTACCTGTGTAGCAGACCAAGCCGCAGACAAAATCCNTC 

AGACACCAAATTAAAGAAGGAAGGGCTTTATTGGGCCTGGAGCTGCGGCA 

AGACTCACGTCTCCAACAACCGAGCTCCCCGAGTGTGCAATTCCTGTCCC 

TTTTAAGGGCTCACAACTCTAAGGCGGTCCACATGAGAGAGTCGTGATAG 

ATTGAGCAAGCAGGGGGTATGTGACTGGGGGCTGCATGCACCTGTAGTTA 

GAATGGAACAGAACATGACAGGGATCTTCACAGTGCTTTTCTTATGCAAA 

T AAC CGATT AGAT CAGGGGTGGAT CTTTAC CAGGC C CAGGGTGTGTCAC C 

GGGCTGTCTGCTTGTGGATTTCATTTCTGCCTTTTAGTTATTACTTCTTT 

CTTTGGAGGCAGAAATTGGGCATAAGACAATATGAGGGGTGGTCTCCTCT 

CTTACCTGCGGGGAGTGAGCTCAAACTCCTTAAAGGAGTTACCTGCCTTC 

CATCATCAGGGAAGCAGGAAATCTTGCCTTCCTTGTTGGAAGCAAGTAAA 

ACTCAAAACAAACAAAGAAAAAAACAGGGAGTTGTACAGCAAAATAAACT 

TTTGATTTTGACCAAATTTTGGGAGATCAGGAATTCTCTGAAGGAGATGC 

TTTCAGACCTCAGCAAATTGTCCTGTTGGTTTGAGCCATAAAGTTAGCTC 

ATGCTGGTACCAAACACCAGTAGGAGATTTGTCAAAGGTAAGAGGCATCT 

CCACTCAGAATCCCTTCGTGGTTACCAACATGTGAACCTTGGAAATCTGA 

GACAGGTCTCAGTTAATTTAGAAAGTTTATTTTGCCACGGTTGAGGACAC 

CCACCCATGACAGAGCATCAGGAGGTCCTGACCACATGTGCTCAGGGTGG 

TCTGAGCACAGCTTGGTTTTACACATTTTAGGGAGACATGAGACATCAGT 

GAATATATGTAAGATGTACACTGGTTCCCTCCAGAAAGGCAGAACAACTT 

GAAGCAGGGAGGGAGCTTCCAGGTCACAGGTAGGTGAGAGACAAACAATT 

GCATTCTTCTGAGTGTCTGATTAGCCTTTCCAAAGGAGGCAATCAGATAT 

GCATTTATCACAGTGAGCAGAGGGGTGACTTTGAATAGAATGGGAGGCAG 

GTTTGCCCTAAGCAGTTCCCAGCTTGACTTTTCCCTTTAGCTTAGTGATT 

TGGAGGCCCCAAGATTTATTTTCCTTCTACATCACTGTGGGCAGCTGACT 

AGGAAAGCTTTGTAGGACTGGTGGGCAGTGTGAGAGCCCAGTGGGGGGTG 

GTGGTCCTGTGCCAATGGTAGCAACCACCTGTGAGGCTGAGTAAACTCAT 

TTCCCAACCTCCTCTAGCAGCCCCAGTGGAGATACAGAGGAAGCAGACTA 

GCGATACAACCCAGCCTGAAGTTTTGTCTGGTGAGTGTAATGGAATAAAA 

ATGGGAAGGGTGCTGAAGAGACCAGCAAGAAAATGGTTGAAGAGATGGGG 

CACAGAAATTAAGCTGGATCAAAAAGGACGGAAAAGCAGAAAGGGCCGAT 

AGAGAGAGGGGATATCTATGGGTTCGCGATTCTGAAAAGGACAAATCACT 

GGTGCTTTGAGAAGAGAGAGGGTGAGAAAGCAGGAAGGCTGGAGGCTGTC 

ATCCAAGAGGCGGACATCTGTGAACATGATTCCAAGAGTCACCAGACCAT 

GGGGGTGGCCAAAGGGAGTGCCTCTTCTCACCTCCTACTCTTAATTCCTT 

GTACTCAAGATAATAAGTTCCCAGAAGAGAAGTACCCATATTTAATTCAT 

CTGTGTCTTC CTAGCAGTACTAAAAATATTAT ATGAAAGGTATCAAAC CT 

TTGAGAATGTGTGCTGCTAAATTGTTAAGGATGCTGGAAAACTCAAGACG 

TCCCTGATCCTGAGCCTGAGTATGAGCCTGTGGTGAGCCCAATGCAGGTC 

TCCATTCAGACAAAGGCCTCAGGGAACGGATGAGACCTAGGGACAGAGAT 

GCATGCTGGAGCAGCATTCCCCATCCCTACTGCAGCTCAGGCCAGCTGAC 

TGCTTTATGAGTAAACGTTACCAGGGAACACTTTGCAGTCTTAACACACA 

TGCCCACCTGTGACCACTGATCCCTGTTGGGTGACCACTGACATCAGAGA 

TTCGATGGCAGCAATGAAGACAAGGCTATCCTCATTAGGAAGGAAAGGAA 

GGAGGAGGGAGGAGGGCAAACGAATCTTTCCTGCTTGTCAACCACGTCCA 

TCTCTGTTAGG7GATTTCCCATGTGTGACTTTGTTTATCTTTATAATAAC 

TCTGAGAGGTAGGTCTTGATGTCCACATTTTGAACATGAGGACATCCAGC 

CAGGAAGTTGAGTTCTGGGGACATAGCTGAGAGGGCAAAGCTACATATAA 

ACCCCTCTTTGTTTTTTCTGGCTTATCCACTGAGTGCCCCCTGCAATCCA 

C CAGC C CA TTTGTGAAGTGCATACTAT AGGTAAGTTGGCACAGGAGGAGT 
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GGATGTGGGCGATTTTG * CACAGCTCTCCAGGAACTTACACACTGGTGAG' 

GAGGGCCAGG7ATGTTCCTGACCAGTCACAATCAAAGCAACCTCCTACTA 

ATCAGGGAGGCTTGGTACCTGGGGAATGCTATGTTGAAAGGTTCTTTTCT 

GGGTTTTAAAATGATGGGTCTATTTCCTTATTCTTAAGATTGCTTTTTTT 

CTGGCTAGAACTTAAAAGAAATTTTCAGTAAAATTTCCCTTCCCTGGCAC 

AAAGTGAGCTTGAAATGAATTCCCAGGTGGCCTTGATACTTTAAAATATT 

GCCTCCTATAAAATCAACCTTTAGAAGAAGGAAGTCAAAGAACATGCTAG 

ATTTCACAAAGGTTAATTCCTTGAAATCCAGTTATCTACAGGACAATGTT 

GTCAAAGAAAAAATTATTTGGCCAGGCACGGCGGCTCATGCCTATAATCC 

CAGCACTTTGGGAGGCTGAGGCAGGTGATCACCTGAGGTCAGGAGTTCGA 

G AC C AG C CT GG C CAACATGGTG AAAC C C CAT C T CT ACT AAAAAT ACAAAA 

AAAATTAGCCAGGTGTGGTGGTGGGCACCTGTAATCCCAGCTACACGGGA 

GGCTGAGGCAGGAGAATCGCTTGAACCCGGGAGGAGGAAGTTGCAGTGAG 

CCAAGTTCAAGCCACTGCACCCCAGCCTGGGCAACAGAGCAAGACTTTGT 

CTCCAAAAAAAAAAAAAATTCAATGATATTTTTAAATTCATGGTAAGGAA 

GATTTCATTCAGAACCAGCACAGAAGATATAGGAAACACTGCAATGGGAC 

TTTGCGGTGGGGGAGAGAGATTGAACACAACTACATATACAGCACGGGCA 

AGGACATATTCATAGCCAGGAAGCAGAGCAAAGATCAGTGGATGCGAAAT 

TACTAAGAGGAAACATGAAAAATAAGGGAGCTTCTGCCTAAACCCACCTA 

ACCGGATCCTTGCTGAAGACAGGACAGGGTGATTGGACACCACTTTGGGG 

ATGGTGGAGGATGGGGAATCCAGTGAGATTTCAAGGGTGATGCGATATTG 

AACATACAAAGTTCTTGCTAAAAAAGGATTTTACAAGAAAGTGTACAAAT 

GTGCCTGGGACAAGGTGCAGGAGCCCGACGGAGATGTGGTCCAGCAGAGA 

ATATGTGC CG AGATGATAGGTGAGTTCTCTGACGAAGGATATATGCTGAT 

CCAGCCAGGGTGAAATGCTCAGAGAAAGCACGGAGGGGCTATGTCCGTTG 

CCCCAGTCTCCACGCGGTCAAATCTGATCCCGTTGTGAGTGTGGCCGTTT 

GTAGAAAGCAATCAGGGGGGGTCCCTCCCC 

>Concig3 0 

AATATATATTTTTTATANNATNTGAGACAGGTTCTCACTAGGTTGCCCAG 

GCTGGTCTTGAATTCCTGCCTTCAAGTGACTCTCCCACCTTAGCCTACTG 

CATAGCTGGGATTACAGGCACAAACCACTGCATGCAGCTAACTTTGCTTC 

TCATTCCAGCACTTTTTATTCCACTGATTATATGTATATGTATATCTGCA 

TCATCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTATATATATATATATAT 

ATGGAAATATCTCTCTCTCTCTCTATATATATATATGGAAATATATATCT 

CAGTCTCTCCTATCCTCCTTTAATCAGTTTTGCTATCCTGTCAATTCCCC 

CAACGAGTGTGATGTTGTGAAATATATATTTGTTCTTCATCTCCTGTTTC 

CTGACATACAGCTTTTAAAAACCCTTGGAATCTCTGGAATAATAAGAGTG 

TCTTTTGCATGCTAATAGATGACTGCTGGCTGGCAGCCCCAATGCAGTAG 

CTTCATGATGGGGTTTGTCACAGGAAAGACCAAGGCAGGATTGGAGACTT 

GAGACTGTTAGCCCCACTCCCCAACCACTGGAGGGAGTGGAGGGGCTGAA 

GGTTGTGTCAGTCACCAATGGCCAATGGTTCGGTCAATCATGTGTATGTA 

ATAAAGCCACTCTTAAAAACCCAAAAAGGACAGGGTTTGGAAGGGCTCCC 

AGATAGCTGGACACATGAAGGTTCCTGGAGGGTGGTGCCCCAGAGGGGCA 

TGGAAGCTCCACACCCCTTCTCACATGCTTTGCTCTGCGCATCTCTTCAT 

CTGGTGTTCATCTGTATCCTTTGTAATATCTTTTAGAATAAACTGGTAAA 

CTTAAGTGTTTTCCTGAGTTCTGTGAGCTGCTCTAGCAAATTCACGGAAC 

CCGAGGGAAGCAAACCCAGATTTATAGCCATCAGTCAGAAGCATAGGTGA 

CAACCTACCACTTGTAACTGGCACCTGAAGTGGGAGGCAGTCTTGTGAGA 

CTGAGCCCTCAACCTGTGGGATCTAACGCTAACTCCAGGTAGATAGTGTT 

GGAGTGAATTAGGACACCCAACTGGTGTCGGCTGCTGGAGGACTAGTGGT 

GGG AG AAAT C C C CAAG CATTTCGGTG AC TAG AGGTCACAGAAGAACTCAG 

TGTTGAGGTGTTGTGACAGTATGGTAGGGAAAACTGCGTCTGGTTTTTTC 

CTTTTACAATCAGTTAAATATTTAACACAAGTCTACTGTATATTAGTAAA 

AGGGTTACATTTTTTAATGTCTTGACAGTTGCACTTTGACAACTTCCATA 

TCAATCACTTTTTTTCGTGTCCGTTTGGAACCAAAATCACTTGGGATACC 

ATGAACCAGGCTGCAGCGTATTCCCCAGGCCTTGAAAGCTTGGAGGCCAT 

TTTGCCAGCCNTAATCCCTGTGAATACCAGGCTTCGTGGATTTAAAAAAT 

AGACTTGAGGCCAGGCCTGGTGGCTCACACCTGTAAGCCCAGCACTTTGG 

GAGGCAGAGGCGGATAGATCACAAGGTTAGGAGTTCGAGACCAGCGTGGC 

CAACATGGTGAAACCCCGTCTCTACTAAATATACAAAAAAAAATTAGCCG 
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GGCGTGATGTTACACGv. JAGTAGTGCCAGATACTCAGGAGGCT GAGGC*G 

GAGAAATACTTGAACCTGGGAGGCAGAGGTTGAAATGAGTCAAGATCGTG 

CCACTGCACTCCAGCTTGGGCGACAGAGTGAGACTCAGTTTTCAGGGGAG 

TTAAAACAATACAAAAAAAGAAAAAGACTTGAACAATGAGGCTCCACTGG 

ATGGATTTAGGGGAATTACAGGAAGCAGGACCTGACGGTGCAATGCCACA 

CTCCACCTGTCCAGAATTGGACCTCACCAAGGGAGGTCTGTGGGGACAGG 

GAGAGGCCCTCTGCCTCCACCCCCTCCTCTACTCCCCAAACCCTGAGTCA 

GGCTGAATGTAGTAAACCTGGAACAGAAAAGTTCAGTTTGGCAATAGGTA 

TCTGAAGGACTCCAGGTGCTTCTCCCTTGATTCAAAATTTTACTTATAAA 

AAAAATTATAAGAAAATTCTACTTAAAAGAAATAATCAGGGAGGTACAAC 

AAATTG7ACTTTTTTTTTTTTTTTTTTTTTTTTGAAATGGAGTCTCACTG 

TTGCCCATGCTGGAGTACAGTAGTGTGATCTCGGCTCACTGCAACCTCCG 

CCTCCTAGGTTCAAGTGATTTTCCTACTTCAGCCTCCCAAGTAGCTGCGA 

TTACAGGTGTGTGCCACCACACCCGGCTAATTTTTGTATTTTTGGTAGAG 

ACGGGGTTTCACCATGTTAACCAAGATGGTCTCGAACTCCTGACCTCAGG 

TGACCCACCTGCCTCAGACTCCCAAAGTGTTGGGATTACAGGGGTGAGCC 

ACTAAGCCCAGCCATTGTACATATTTTGTGGGTATTTACTAAAACATTAT 

TCAAAATAGTAAAAAAAAATTGAAATAAACTGGGGACTGGTTAAATAATT 

TTGGGTACAACCACATGATGGAATACTATACAGCCATTAAAAATTACATT 

GAGGCCAGGTGTGGTGGCTCATGCTTGTAATCTTAGCACTTTGGGAGGCC 

AAAGTGGGAGGATTGCTTGGACCCAGGAGCTCAAGACCAGCTTGGGCAAT 

GTGGCAAAACCCTGTCTCTAAAAAAAAAATACAAAAAAAATTAAAAAGCT 

GGGTGTGGAGGCACACACCTCTAGTCCCAGCTACTCAAAGGGCTAAGGTG 

GGAAGATCACTTGAACCGGGGAGGTCAAGGCTGCAGTGACCCAAAATCGG 

GTCATTGCACTCCAGCCTGGGCAACAAAGCAAGACCCTGTCTCAAAAAAA 

AAAAAAATACATTGAAGAATATCTTACGGTATGGATAAATATTCATTTTA 

CAGTGATAGATGCAAATAAAAGCAAATTACAAAATATACAGTTTAATTCC 

AACTTTGATACTACATATGTATATATGAATACATGCATATGTTATGTATG 

TATATGTAAATATAACAATATATGTTCTATATATGGATATTATATATTTA 

CACATACATACACACATATATAATATCTTCTCTAGAGAGCAGAAAGAGAG 

TAGACAGATAATGAAGATAGGATACAACTCCAGTCCAGCTCAACCTAGGG 

GACTTGTTTTAAAGCCTCAGGAGAGAGAAGTTGGGACTAGAAAGCAAGGC 

AGCTATTTGTAAGCATCTTTGTGTTTCATGCTATTGGGGTGGGAAACAAC 

AGCACAACTTTTGAAAGCCCCTTTCTACTCACCCCACAAACTGCAGAGCA 

GCTTTAGGACCCTCAGAGTTCAAGAAGACCATTTGCAGAGTAGAAGAAGT 

AAAAACATGTATGAACTTGACCCTGAGCTCATGGACTGTGCCATGAGGGA 

AATTCCTAAAACAGCAGGAGAGGCCCTGGAGGAAGGCAGAGGCCCTGCAT 

ZAGCAAGTCCAGGCAAAAGCCTGCATTCCATAGATGCTCATCTCTCTGGC 

rGGTGAGGTCTAAAGACGTTTGGTCTCAATATTAAGTCTCGTGAGAGAGG 

TCACAAACCCAGTCCCTTGGCCACAAAAGGAAATAAATTCTGGCTTGAGA 

CATTAGGGAGGAACAGGGCAAGGGGAGGTTCAAGAAAGTTTTAATGGATG 

AGATGATATTTAAGCAAGGCCCTGGAAAATGAGAATTTCAACCAATAGCC 

ATATGGTAGGTCAGAAAGCAAAGATAAGGAGGGGGCAAGTGCAAGGGGCA 

ACATCAGATATGACCAGGGTGTCGTGGGGCATGGCTGATGGAGAAGAAGA 

TTAGACTGGAGTTTGGGAATGCCACAGTATCGAGGTTGGATTTAATCCTA 

TGGGTAATAAAGCCAACTGTTCAACCCCCAACCCACTTGCAATATGGCTC 

CAAAATAGCAGGTGTTTGATAAAATGACTACTTTTACTCTACTATTCCCT 

CCCTCTTAAGAAGAAAAAGAAAGTGGAGGCTCAGAGAAAGGCAGTGGCTT 

GTCCCAATCACACTATGATTTGGCCACAAAACZAAGAACGAAATGTTACAC 

CCAAAAATGCTGCCTCCACCTCCCTTCCTTGCTTTCCTCCCTGCTGGACT 

ACAGACTATCTCAAGAGTGACGTACACCATCAGGGCTTCAGCTTTTCCCC 

GAAACAATGCCAAAATATTAGCCATACGTCACTGTAGTAAGAGCCCTGAA 

TTGGGAATCCCAGCTTTGACGCAGACATGCTGATTGACTCTGTGACCATT 

CTCTTCACTTCTCCACTCTATTCTTCCCCACCTGTAAAGTGAGGTCCTTT 

CCAGTTATAAAAACAGATGATGCTATTGTCCTGTTTTGTATCTAATCTTG 

CTGTGTTATAAAAAAAAAATAAGGCTCTGTACATTCATCTTGGCCAATTC 

CCTTCTTATCTCTACTTCCCACAGCCCCTTTTTCTACAGAAAACCAGCAT 

TGTTCTTCTGGATCCATCTCTTAAGAAAGCGCTTTGCCTCCCCGGTTATT 

TAGGTGATAAGAAGTGTCCTAGATGACAGCCCTGGAATGGGCTGGAGGCA 

ACAAAAAAGCAAGTGAAATAGACAGTTACAGCGACGACAATAATAACAAC 
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CAACACCTCTCACTAAAGAGAAAGAAATAAAAAAGAAAATTAAAATCTGC 

CGCAATGCCCACACAGTCATTGAATAACTGCATGTGTACAGCACTTGGTT 

ACTTTTACATACTTCATATTTTAGCCTTCATAGCAGCTCACAGGGGTGGA 

TTTAATTTTTAGTCCAACTCCTGTCACGGTGCCTGGCACAAGTATAATAA 

ATGTTCTGTGAATAAATGACCCTCTTTTTAGATGAGGAAATCGAGGCTCA 

AGGAGAACAAGCAATGTAATGTCCCCCTCCTGTTCAGCCATCTGCCTTTC 

ACGCCACTGAATGCAGTAGTCCTCAGTGCCCTGAACTTGACCCTCTTCTG 

CTTTTCGGACTGGTCCTTCTAATCCCGTTGTGACTCACTACACCACCTCT 

CCTGCATATGACATCTACATTTTAAAACAAACCGTATGGAAATAACACAT 

TAGTCGGCTTGTTCCCCCACCCCCGCAAAAAAAAAGGCCTCTTTATAACA 

GAAACTTCTCAGGCTGGTAGGGGAATTTTATTCCCCCATTTATGGTAGAA 

AGGCCCTAACCTTGGACCTCACGCCATAGCTATTCACATGGGGGAATGAT 

GAATAACATGGGGAGCAGCATGTAAATATCATTGAGCCGTAGTCCAGACC 

TATAACACATC 

>Concig31 

GGGGGAGCTGCATGTGCCTGTCGAGATCTGGGGGAGGAACASGAAGATCA 

AGAGTTCTGTGTAGGACATGTTAAGTTGAAGGTGCTTACAGGATAGCCAG 

ATGAAGCATCAGGTGTGCAGTCAAAGATATGAGTCTGGAGCAGCACATCC 

TAAGTCACCTCCTGCACCAACACAGAACTTCCAGGCCACTCACTTGAGCT 

CTCCCAAATAGTTTCCAAGTGTCATTATGTTAATAACCTATGAGCTTGAA 

CACCAGATTCAAACCCCACTGCATGGCTTTTAAAGACCATCTCAA'GGGCT 

TGACACTCCAGGGAGCCAACTAAAGATGCCTGGTCCTACCATCAACCTCC 

ACCCCATTTTTTATAGAAAATGTTTCTACCTGTCCTAAGGCAGGGTCCTG 

CCCCACTCCCAGGCCCCTTTAGATCCCCAATATTCCTCCTCCCTGAACCA 

AAACCCTCATCATCTTCCAGCATGGGTGGGGCCTCCATTCTTGCTTCTGC 

TCCCCTGAGCAGAAGCAAGTTTCTCCCAACTTGACCTGATTCTCCTCCTA 

AGTACCAGTCACTGCTTTGTTTCTGGAATGAGAGAAAAAGACAGAGTGAG 

AGAGACAATCCAGAACTCTTGCTCACTCACAGCTAGGCTGGGCATCTGGG 

AGGATGGCTGTGTCCATGGGAACCTGGGAAAAGCCACACCCTTGGCACCC 

TGGTCACCCACCTGTCTCCCTGGCAGATTCCGCACTGCTCTCTTGCACCC 

TCTACCAGGGCTAACCGGCCTGCTCACTCTCCCCAGCATGTCTTCCCACG 

CCCACTCTCTAATTATTACATTCCCTTCACATAAACTGCCCTTCTCTCCC 

AATCACCACATGTTCACTTCCCACCCAGCTGTCAAAGTCTGGCTCAACCT 

CATTCTTGAAAAGGAAAAAACAAACAAACAAACAAACAAACAAGCAAAAA 

ACCTATGATGGATTAAGAACACACTTCATTCCAGGAACATGCTTATCTCC 

TCTAACTCTCACAACAACTACAGCAGGTAGGTGTTATCACACCCATCTCT 

CAGGTGAGAAAACAGGCTCAACGAGTGCAGGAGGACACAGCAAGTCAGTG 

ACAAAGCTTAAATTCAAGCCCAAGCCTGTTGGCAACCAACGTCTGTACCC 

TTGATAGCTACCTCATTTACCACCAAATCCAGTGGCCTCAGGCCTGGCTG 

CACACTGGGATCACCTGGTGCCCAGACCACATCTTAGACCAGTCATACAG 

AATCTCTTGGGCTGGGATCCTCCACGGTACATTTTAAGGGTCCCCAGGTG 

AGTTCCACCATGGACCCAGAATTGAGGACCCAATACCGTATACCATCTCC 

TTCTTCATCTCTTCTAAGGCATCTCTTACTCGCTGTGCACTCCCATACCA 

CTTTGTTCAATCATCCAATCATTCATTCATTGAGTCAGTTAGTCAGGAGC 

TACTCACTAGTCCCCTGCCAGGTCCTAGTCATGACATAGGGCTCTGGGGA 

CCAACAAGAAGCAGGACCCATGCCTCCTGCTCTCATGGAGCTTGCTCTGC 

AGCAGAGGAAGCAGTCAGTGAGATGTAGCAAATGTGAAATGTGCACAGAT 

GGGAAAAGCAAAACTTTAAAACTTTTAGGAC^AAATACACAAGAAATCTT 

TGCAACTTTGGGACAGGAAGGAACAACATTCCTTACACATGACACCAAAG 

GAATCAACCATAAATAAAAAGGTGATCAATTTGACCTCATTTAAGTGTTA 

AGCTTTTTTCATTGAGAGACACCATTAAAAATTAAAAATACATGCCACAA 

ACTGGGATACAATATTTACAACACTTATGTCTCACAAAGGATTAGTTTTC 

AGAATATATAAAGAACTCCCGGCCGGGTATGGCCGCGCACGCTGGAATCT 

CAGCACTTTGGGAGGCCAGCGGATCACATGAGGTCAGGAGTTCAAGACCA 

GCCTGGCCAACATGGCAAAACTCCGTCTCTACTAAAAATACAAAAATTAG 

CCAGGCA7GGTGGCGGGCGCCTGTAATCCCAGCTACTCAGGAAACTGAGG 

CAGGAGAATCACTTGAGCCCAGAAAACAGAAGTTGCAGTGAGCTGAGCTC 

ACATCACTGTAAGCCTCGGTGACAGAGTAAGACTGTCAAAAAAACGAAAA 

CAAAAACAAAAACTCCTACAAATAAATAAGAAAAAAATAGCCCAGCAGGA 

AAAAGTATATACATTTC^TAAAAGAATAAATACATTCTGTCAGTTTTCTA 
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. ACATATATTTTTTAAGAGTAAATACAAATGGTTAGGAAACATTT^TAAA 
A7GCCCAACCTCA7TAAAAATTA7AGAAG7GAAAA7TAAGCCACAATAAG 
ATACGATTTTATACCAAATACAGTGTCAACACTTTGCAAGTCTGACCTCA 
CCAAG7G77ACCAGACGTG7GCAC7GACG7GGC7GC7GAGATAC7GA7GG 
TGGGTC TGTAAATCTGTACTACAAACAATTGCAATAAAATGTAATAAATA 
7ACAAT AGG7GG AGCAGGAAG7GAC CTG CAAC CATATAGCAGAT AGGGC A 
G QAAAAAG C CT ATGAAAG C TG ACAT C AAAGGG AT AAGTT C CAG77 AC C CA 
GCTGAAGGGAAGGAGGGTGTTTCAGATAGAGGAAGGATAAGCATGACCTA 
7TCAAGGCCAGTGAAAGAAGCGTGCAACGGCCAAG7CAGGAGAACC7GAA 
ATTGTG7CAAAGAGCTTGGATGCAAAGAGCCGTGGGAGACTATTGGGGGT 
TTTAAGCAGGGATATAATATTCATTCAAGCATGCAGTAAAAGGTCACTGG 
CACCTGCCATGGGCCAGGACTCGGGCTCTACATGATTGCGTCTGTTTTGG 
AAATATCACCCTGGCTGTGAGATGAAGAACAGGTAGGAGGGTCACAAAAC 
TTGAAGCAGAGAGACTGTTGAGGAAGTAAGCTGTTTTTGTGTGGACTGTG 
GCAATCACAGAGGCAGAGGATATAAATGCACAGAGACACAAGGCATGTGG 
GAGGCAGAAGGAATCAAATACAATGAGTGATCAGATG7GGGGTTAGAGTG 
GTGAGTGAGAAGACATACTCAAGGTGACACGCCCAGGTATCTGGGTGGAT 
GGT AAG AC ATT C ATGGACTAGGATCGAGG AANGAGGTGGGG AATGGG AC C 
ATACCTGCAGTTTATAAGGGGTGGACGAGGGAAGATTATGCGGGAGACTG 
AGAGAGGAATAGACAAAGGAATCCCGGTGCAGTATTACAGAAACTGGGGT 
GGGAGGGGGTTGTANTTCAAAAAGGAAAGAAAATTGTCAAATAGTATGAA 
ATGCTGCAGAGAAACTCACGGATTTTTTTTTTAAGCTTAGAATTATTCAT 
TGACTATGTGAATAAGAATAACTTTTATGAAAGAAGTTTTGCTTAAGTAG 
TAGGAAGAAGCAAAATTGTTGAGGGCTGATGAGTGGGAGGAGAAGTAATT 
GAAGGCACTCTTTCAAGAGAAACAAAGCAGAAGGTGAGGAGAATACTAAT 
GAAGGAGTTACGGCCTTCACTATTTTGTTTTGCTTTAGATAAGCAAGACT 
TGAGTGGGTCTGGTGAGGAGAAACAAGTAGAGTACAAAGTTAAAGGAGAG 
ACAGACAGAGATAGAGATAGGGACAGAGAGAGAGACAGAGACAGAGCACA 
AAAGAGCAAGGTCCCTGAGAACACGGGCCTTCTGTTTAAACCCCAGCCAG 
ATGTATTGCAATTCAATTCCAGTACTAACCACCCAGAGTTTGTGTAGACT 
CTACAAGTTAAAGAGCATGGTCCCCAACAAGACTGCTTCTACGTCAGATG 
CCAGGCACACTTCAGGGGTCCCCAAGCCACTCATGTTTTTTGAATGACTG 
CCATAAGTTCAAAAATTCCCACAATTCTCTCAGATTCAATAACTGGGTAT 
AACCACTCATAGAACTCAAGAAAATGCTATCATTATTATTACAATTTTAT 
TATAAAGGATACAAATCAGAAGGACTAGCCAAATGAGGAGACACATAGAG 
AGAGGACTAG7AAAAAACAGAGCTTCTGCGTCCTACCTTCAAGGAATCAG 
GATGCACCACCCTCCCAGCACATCAAGTGCTCATCAACCAGGAAGTTCCT 
CTGAGCTCCAATGTCCAGAGATTTTAGGGAGGATTCATTACATAGGTATC 
ATTGATTAAATCATTGGCCATGTACTTGAACTCAATCTCCAGTGTCCCTC 
TTC7CCCTAGAGG7CTGAAGGGTTGGCTAATA7CA7G7GGC7CAAAGCCC 
CAACTCTAATTACCTTTTTGGTCTTTTCAGGGACTAGACCCCATCCTGAA 
GCTATCTACAGGCCCTGCCATGAGTTAGCTCATTAACATAACAAAGACAC 
TTATATTACTCAGAAAATTCC AACAGT TTTAGAAGCTCCATGTCAGGAAC 
CTGGGACATAGATCAAATTCTTTTTTTTTTTTTTTTTTTGGAGACAGGGT 
CTTGCTGTGTTGCCCAGGCTAGAGTGCAAGGACAGATCACAGCTCAATGC 
AGCTTCAACTTCCCAGGCTTAAGTGACCTTTCCACCTTAACCTTCCAAGT 
ATCTGGGACCACAGAAAATGGCTAATTATCCTGGCTGATTTTTAAACTTT 
TTTTTTTTGTAGGGATGGGATCGCCCTGTGTTGCCAAGGTTGGTCTCAAA 
CTCCTGGGTTCAAGCAATCATTCTGCCCTGGCCTCTGTGATGGTTAATAC 
TGAGTGTCAACTTGATTGGATTGAAGGATACAAAGTATTATTTTTGGGTG 
TGTCTGTGAGGGTGTTGCCAAAGGAGATTACATTTGAGTCAGTGGACTGG 
GAAAGTCCACCCTTTCCCAGTGGACTGGGAGACCCACCCTCAATCCAGGT 
AAACACAATCTAATCAGCTGCCAGTGTGGTCAGAATAAAAGGAGGCAGAA 
GAACAGGGAAACACTAGACTGGCTTAGTCTTCCAGCCTACATCTTTCTCT 
GATGCTGAATGCTTCCTACCCTCGAACATCAGCCTCCAAGTTCTTCAGTT 
TTTGGACTCTTGGACCTTCAACCACAGATTGAAGACTGCAGTGTTGGCTT 
CCCTGTTTTTGAGGTTTTGGGACTCAGACTGGCTTCCTTGCTCCTCAGCT 
TGCAGATGGCCAATTGTGGGACTTTAACTTGTGATCATGTGAGTCAATAT 
TCCTTAATAAACTCAGATATATATATATGTATCAGACATATATATATATC 
CTATTGTA7ATTATATACAGATATATAATATCCTATTATATACAGATATA 
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TAATATCCTATTATATACAGGTATATATATATATATGTATCXTATATA-ri: 

TATCCTATTGGTTCTATCCCTCTTGAGAATCCTGACTAATACAGCCTCCC 

AAAATGCTGAGATTACAGGAGTGAGCCACAGCCACCATGCCCAGCCCCAA 

ATTCTTAATTATACAACAATGGGTCCAGAGATCAGGGCCTGGGTAGGATG 

CAGCAATAAGAAAACAGATGGTGGATGGGGACACATGTTGGAAGTGTGGC 

AGGACATGGCTGAGGGAACTCATAGGATGGTGTCTATTTTCATGGCTGAG 

TGTGAGGAACAGCATAAGGTCAAAATTTCAGGTCAATGGTGAGTTTTTTA 

AATTGTTGCTGTGAACCCCAAAAATCTGACCCAGGTCTCAGTTAATTTAG 

AAAGTCTATTTTTCCAAGGTTGAGAACACCCACCCACTCACGACAAGAGC 

ATCAGGAGGTCCTGACCACATGTGCCCAAGGTGGTAAGAGCACAGCTTGG 

TTTTATATATTTTAGGGAGACGTAAGTCATCAATCAATATATGTAAGATG 

TACACTGGTTCTGCCTAGAAAGGCAGGACAACTTGAAGCAGGGAGGGGGC 

TTCCATGTCACAGGTAGGTGAGAGACAAACAGTTGCATTCTTTGAGTTTC 

TGATTATCCTTTCCAAAGGAGGCAATCAGATGTGCAATTATCTCAGTGAG 

CAGAGGGATGACTTTGAATAGAAAGACAGGCAGGTTTGCCCTAAGAAGTT 

CCCAGCTTGACTTTTTCCTTTAGCTTTGTGATTTGGAGGCGCCAAGATTT 

ATTTTCCTTTCACATTTCCCCCCCTTTCTTTTTAAGAATCTTTTAAAGAA 

AGCTTTTAAAAAGAAAATGAGTCTCTGGTCCCAGGTTTCATCTGAATTCT 

CGAGGGGAGGATGGTTTATCCTAAACGGGTGGTTCTGAATTTTGAGAAAG 

TGCATTGTAC 

>Concig32 

AAAAGC CAT ACGAATGAGGAAGAATTAAGGGC CAGAACAAAACAAGAAG A 

TGAGGGAAAGT7TGGAACTTCTTAGAGACTGGCTAAATGGTTGTGACCAA 

AATGCTGATAGTGATACGGACAATGAAGTCCAGGGTGACAAAGTCTCAGA 

TGGAAATGGGGAATTTGTTGGGAACTGGGCAAAGGTCACCCTTGCTATGA 

CTCAGCAAAGAAATTGGGTGCATTGTGTTCATGTCCTGGGGATCTGTGGA 

AGTTTGAATGTAAGAGTGATGACTTACGGTAGGGTATCTAGTGGAAGAAA 

CCTCTAAGCAACAAAGTGTGTTGCTTAGAAATTTCTTTCTTTCTTTTTTT 

TTTTTTTTTGAGCTGGAGTTTTGCTGTGTCGCCCAGGCTGGAGCGCAGTG 

GCGCAATCTTGGCTCACTTCAAGCTCTGTCTCCTGGGTTCATGCCATTCT 

CCTGCCTCAGCCTCCCAAGTAGCTGGGACTACAGGCGCCTGCCACCATAC 

CTGGCTAATTTTTTAGTATTTTAGTAGAGACGAGGTTTCACCATGTTAGC 

CAAGATGGTCTCAATCTTCTGACCTCGTGATCCACCCGCCTTGGCCTCCC 

AAAATGCTGGGGTTACAAGCATGAGCCACCCCGCCTGGCCTGCTTAGAAA 

TTTCTAAGCCAGGATATGGCCTGTCTGCTTCTAACAGCCTGTGCTCAGGG 

GTAAAGAAATGACTTAAAGTTGGAACCTATGTTTAAAATGGAAGTAGAGT 

CTAAAAATTTGGAAAATTTGCAGCCTGGCCTTGTGGCAGAGAAAGAATCC 

AAGTAGGCTGCAGAGCAATCATTGC7AGAGAGATTAGCATGACTAAAAGG 

G AGC C AAGTG CT AATATTCAAGACAATGTTAAAAAGGC CTTGAGGGCATT 

TCAGAGATCTATGAAGCAGCCCCTCCCATCACAGGTGCAGAGGTTTGGTG 

CACTAGGCCCAGAGGTTTTATGGGCCANNGCCAGGGCCACACTGCTATGC 

ACAGCTTTGGGACACTGCTGCCCGCATCCAGGCCACTCTGCTCTGGCTCC 

ACCCTTGGCTCAAACGGGCCAAGATAGAGCTTGGACCACTGCTCCCGAGG 

GCACAAGCCATAAGCCTTGGTGGTTTCCATGTGGTGTTAAGCCTGCAGGT 

GCCCAGAATGCAAGATTGAGGGAGCTTGGGCACTTCCACCTAAATTTCAG 

AGGATGTGTCAGAAACCCTAGGTTCCCAGGCAGAAGCATGATACAGGGGC 

AGAGCCCTTGCAGAGAACCTCTACTAGGGCAATGCCAAAGGAAAATGTGG 

GGTTGGAGTCCTCACACATGGTCCCCACTGGGGCACTACCTGGTGATACT 

GTGGGAATGGGGCTGCTGCCCTCCAGACCCCAGAATGGTAGATGCACTGG 

CAGCTGGCACCCTGAGCCTGGAAAAGCTGCAGGCACTCAACTCCAACCCA 

TGAGATCAGCCACATGGGCTACTCCCAGGGAAGCCCACAGAGGCAGGGCT 

GTCTAAGGCCTTGGGAGCCTACCCCTTGAACCAGCTTGCAGGACATGGAA 

TCAAAGATTATGTTGCAGCTTTAAGGCTTAATGTTTTCCCTGTCAATTTC 

AGGCTTGTGTGGGACCTGTTGCTTTTTTTTTTTTTTTTTTTTTTTTTGGT 

CACAGG7GTTTGAACCAGAACAATTCCATCTTGAATAGGGGCTGGGTAAA 

ATAAGGCTGAGACCTACTGAGCTGCATTCCTAGGAGGTTAGGAATTCTAA 

GTCACAGGAGGAGATAGGAGGTCGGCACAAGATACAGGTAGCGAAGACCT 

CGCTGATAAAATAAGTTGCAGTAAAGAAGCCAGCCAAAACTCACAAAGCC 

AAAATGGTGATATGGTTTGGCTCTATGTCCCCACCCAAATCTCATCTCAA 

ATTATAATTCC CATAAT C C C CACATGTTGAGGGGAGGACCTGGTTGGAGG 
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tgattggattatggaggcaatttcccccatgctgttctggtgatACtgag 

tgagttctcataagatctaatggttttataagtgtttggaagttcctcct 

acacacatgctcacactctctcctgcagctttatgaagaaggtacttgct 

ttcctttctgccatgattgtaagtttcctgaggcttcccagctatgcaga 

actgtgagtcaattaaacccgttttctttatacattaccagtcttgggca 

gttctttacagcagtgtgagaactgctggcgatgagagtgacctctggtt 

gxcctcactgctcattatatgctaattataatgtattagcatgccaaaag 

acactcccaccatgaccccaacagtcatgcctgtgccggtctcagcacca 

tgacagtttacagatggcatagcaacgtctaaaaggtaccccatatggac 

taacaaggggaggaaccctcagctctgggaagtgcctacctcgttcccag 

aaagcttgtgaataatccactgcttgtttaacatataattaagaaataac 

tattaagcatccttagttcagcagcccaagctgctgttctgcctatggag 

tagccattctttattccgttactttcttaataaaattgcttttactttac 

tgtatgtactcgcctggaattctttcttgtacgaggtccagagccctctc 

ttgggtctggatcgggacccctttctggtaacattttgaccaatttctcc 

cttctggaatgggaatgtttacacaatgactgtatcacttttgaatcttg 

gaagtaaataatttgtttttgactttacagcctcataggtggaaggaact 

tgacttgaatttcagatgagactttggactttgggacttttgggttgggg 

ctggaatgagttaaaagttggggggattattgggaaggcacgattttatt 

ttgcaatatgagaagcacatgagatttgggggaccaagggtggaataata 

tggtttggatgtttgccccctccaaatctcacattgaaatgtaatcccca 

gtgttgaagtgaggcctgctggaaaatgtttggattacaaggctgtcgag 

cacattggataagacgtgtaggnccc 

>Concig3 3 

CGCAGCTCGCTGGTTAATTCTGTGGCTCCTGTGACCACTATTATAGCACC 

AGGTCTATGACCAGGAGAATTAGACTGGCATTAAATCAGAATAAGAGATT 

TTGCACCTGCAATAGACCTTATGACACCTAACCAACCCCATTATTTACAA 

TTAAACAGGAACAGAGGGAATACTTTATCCAACTCACACAAGCTGCTTTC 

CTCCCAGATCCATGCTTTTTTGCGTTTATTATTTTTTAGAGATGGGGGCT 

TCACTATGTTGCCCACACTGGACTAAAACTCTGGGCCTCAAGTGATTGTC 

CTGCCTCAGCCTCCTGAATAGCTGGGACTACAGGGGCATGCCATCACACC 

TAGTTCATTTCCTCTATTTAAAATATACATGGCTTAAACTCCAACTGGGA 

ACCCAAAACATTCATTTGCTAAGAGTCTGGTGTTCTACCACCTGAACTAG 

GCTGGCCACAGGAATTATAAAAGCTGAGAAATTCTTTAATAATAGTAACC 

AGGCAACACCATTGAAGGCTCATATGTAAAAATCCATGCCTTCCTTTCTC 

CCAATCTCCATTCCCAAACTTAGCCACTGGCTTCTGGCTGAGGCCTTACG 

CATAC CTCC CGGGGCTTGCACACACCTTCTTCTACAGAAGACACACCTTG 

GGCATATCCTACAGAAGACCAGGCTTCTCTCTGGTCCTTGGTAGAGGGCT 

ACrTTACTGTAACAGGGCCAGGGTGGAGAATTCTCTCCTGAAGCTCCATC 

CCCTCTATAGGAAATGTGTTGACAATATTCAGAAGAGTAGGAGGATCAAG 

ACTTCTTTG7GCTCAAATACCACTGTTCTCTTCTCTACCCTGCCCTAACC 

AGGAGCTTGTCACCCCAAACTCTGAGGTGATTTATGCCTTAATCAAGCAA 

ACTTCCCTCTTCAGAAAAGATGGCTCATTTTCCCTCAAAAGTTGCCAGGA 

GCTGCCAAGTATTCTGCCAATTCACCCTGGAGCACAATCAACAAATTCAG 

CCAGAACACAACTACAGCTACTATTAGAACTATTATTATTAATAAATTCC 

TCTCCAAATCTAGCCCCTTGACTTCGGATTTCACGATTTCTCCCTTCCTC 

CTAGAAACTTGATAAGTTTCCCGCGCTTCCCTTTTTCTAAGACTACATGT 

TTGTCATCTTATAAAGCAAAGGGGTGAATAAATGAACCAAATCAATAACT 

TCTGGAATATCTGCAAACAACAATAATATCAGCTATGCCATCTTTCACTA 

TTTTAGCCAGTATCGAGTTGAATGAACATAGAAAAATACAAAACTGAATT 

CTTCCCTGTAAATTCCCCGTTTTGACGACGCACTTGTAGCCACGTAGCCA 

CGCCTACTTAAGACAATTACAAAAGGCGAAGAAGACTGACTCAGGCTTAA 

GCTGCCAGCCAGAGAGGGAGTCATTTCATTGGCGTTTGAGTCAGCAAAGG 

TATTGTCCTCACATCTCTGGCTATTAAAGTATTTTCTGTTGTTGTTTTTC 

TCTTTGGCTGTTTTCTCTCACATTGCCTTCTCTAAAGCTACAGCCTCTCC 

TTTCTTTTCTTGTCCCTCCCTGGTTTGGTATGTGACCTAGAATTACAGTC 

AGATTTCAGAAAATGATTCTCTCATTTTGCTGATAAGGACTGATTCGT7T 

TACTGAGGGACGGCAGAACTAGTTTCCTATGAGGGCATGGGTGAATACAA 

CTGAGGCTTCTCATGGGAGGGAATCTCTACTATCCAAAATTATTAGGAGA 

AAATTGAAAATTTCCAACTCTGTCTCTCTCTTACCTCTGTGTAAGGCAAA 
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7ACC77A77C77G7GG7GT7777G7AACC7C77CAAAC777CA77GA7x'G ! 

AATG C C T G 77 C 7GGCAA7ACA77 AGG77G GG CACA7AAGGAA7 AC CAAC A 

7AAATAAAACA77C7AAAAGAAG777ACGA7C7AA7AAAGGAGACAGG7A 

CATAGCAAACTAATTCAAAGGAGCTAGAAGATGGAGAAAATGCTGAATGT 

GGACTAAGTCATTCAACAAAGTTTTCAGGAAGCACAAAGAGGAGGGGCTC 

CCC7CACAGA7A7C7GGA7TAGAGGC7GGC7GAGC7GA7GG7GGC7GG7G 

T7C7C7G77GCAAAAG7CAAGA7GGCCAAAG77CCAGACA7G777GAAGA 

CC7GAAGAAC7G77ACAGG7AAGGAA7AAGA777A7C7C77G7GA777AA 

TGAGGGTTTCAAGGCTCACCAAAATCCAGCTAGGCATAACAG7GGCCAGC 

ATGGGGGCAGGCCGGCAGAGGTTGTAAAGATGTGTACTAGTCCTGAAGTC 

AGAGCAGGTTCAGAGAAGACCCAGAAAAACTAAGCATTCAGCATGTTAAA 

CTGAGATTACATTGGCAGGGAGACCGCCATTTTAGAAAAATTATTTTTGA 

GGTCTGCTGAGCCCTACATGAATATCAGCATCAACTTAGACACAGCCTCT 

GTTGAGATCACATGCCCTGATATAAGAATGGGTTTTACTGGTCCATTCTC 

AGGAAAACTTGATCTCATTCAGGAACAGGAAATGGCTCCACAGCAAGCTG 

GGCATGTGAACTCACATATGCAGGCAAATCTCACTCAGATGTAGAAGAAA 

GGTAAATGAACACAAAGATAAAATTACGGAACATATTAAACTAACATGAT 

GTTTCCATTATCTGTAGTAAATACTAACACAAACTAGGCTGTCAAAATTT 

TGCCTGGATATTTTACTAAGTATAAATTATGAAATCTGTTTTAGTGAATA 

CATGAAAGTAATGTGTAACATATAATCTATTTGGTTAAAATAAAAAGGAA 

G7GC77CAAAACC777C7777C7C7AAAGGAGC77AACA77C77CCC7GA 

AC77CAA77AAAGC7C77CAA777G77AGCCAAG7CCAA77777ACAGA7 

AAAGCACAGGTAAAGCTCAAAGCCTGTCTTGATGACTACTAATTCCAGAT 

7AG7AAGA7A7GAA77AC7C7ACC7A7G7G7A7G7G7AGAAG7CC77AAA 

TTTCAAAGATGACAGTAATGGCCATGTGTATGTGTGTGACCCACAACTAT 

CATGGTCATTAAAGTACATTGGCCAGAGACCACACTGAAATAACAACAAT 

TACATTCTCATCATCTTATTTTGACAGTGAAAATGAAGAAGACAGTTCCT 

CCATTGA7CATCTGTCTCTGAATCAGGTAAGCAAATGACTGTAATTCTCA 

TGGGACTGCTATTCTTACACAGTGGTTTCTTCATCCAAAGAGAACAGCAA 

TGACTTGAATCTTAAATACTTTTGTTTTACCCTCACTAGAGGTCCAGAGA 

CCTGTCTTTCATTATAAGTGAGACCAGCTGCCTCTCTAAACTAATAGTTG 

ATGTGCATTGGCTTCTCCCAGAACAGAGCAGAAC7ATCCCAAATCCCTGA 

GAAC7GGAG7C7CC7GGGGCAGGC77CA7CAGGA7G77AG77A7GCCA7C 

C7GAGAAAGGCCCCGCAGGCCGC77CACCAGG7G7C7G7C7CC7AA7G7G 

A7G7G77G7GG77G7C77C7C7GACACCAGCA7CAGAGG77AGAGAAAG7 

C7CCAAACA7GAAGC7GAGAGAGAGGAAGCAAGCCAG77GAAAG7GAGAA 

G7C7ACAGCCAC7CATCAA7C7G7G77A77G7G777GGAGACCACAAA7A 

3ACAC7A7AAG7AC7GCC7AG7A7G7C77CAG7AC7GGC777AAAAGC7G 

7CCCCAAAGGAG7A77TC7AAAA7A7777GAGCA77G77AAGCAGA7777 

7AACC7CC7GAGAGGGAACTAA77GGAAAGC7ACCAC7CAC7ACAA7CA7 

TG77AACC7A777AG77ACAACA7C7CA77777GAGCA7GCAAA7AAA7G 

AAAAA7C77CC7AAAAAAA7CA7C77777A7CC7GGAAGGAGGAAGGAAG 

G7GAGACAAAAGGGAGAGAGGGAGGGAAGCC7AA7GAAACACCAG77ACC 

7AAGACCAGAA7GGAGATCTTCC7CAC7ACC7C7G77GAA7ACAGCACC7 

AC7GAAAGAAC77TCAT7CCCTGACCATGAACAGCC7C7CAGC77C7GT7 

7TCC77CC7CACAGAAATCCT7C7A7CA7G7AAGN7A7GGCCCACTCCA7 

GAAGGC7GCA7GGA7CAATCTG7G7C7C7GAG7A7C7C7GAAACC7C7AA 

AACA7CCAAGC77ACCT7CAAGGAGAGCA7GG7GG7AG7AGCAACCAACG 

GGAAGG77C7GAAGAAGAGACGG77GAG777AAGCCAA7CCA7CAC7GA7 

GA7GACC7GGAGGCCA7CGCCAA7GAC7CAGAGGAAGG7AAGGGG7CAAG 

CACAA7AA7A7C777CT7TTACAG7777AAGCAAG7AGGGACAG7AGAA7 

7TAGGGGAAAA77AAACG7GGAG7CAGAA7AACAAGAAGACAACCAAGCA 

7TAG7C7 GG7AACTA7ACAGAGGAAAA77AA77777A7C C77C7CCAGGA 

GGGAGAAA7GAGCAG7GGCCTGAA7CGAGAA7AC77GC7CACAGCCA77A 

T77C77AGCCA7A77G7AAAGG7CG7G7GAC7777AGCC777CAGGAGAA 

AGCAG7AA7AAGACCAC77ACGAGC7A7G77CC7C7CA7AC7AAC7A7GC 

C7CC77GG7CA7GT7ACA7AA7C7777CG7GA77CAG777CC7C7AC7G7 

AAAA7GGAGA7AA7CAGAA7CCCCCAC7CA77GGA77G77G7AAAGA77A 

AG AG7 C7 2 AGG C777ACAGAC7G AGC7 AGC7GGGC C C7 C C7G AC7G77A7 

AAAGA77AAA7GAG7CAACA7CCCC7AAC77C7GGAC7AGAA7AA7G7C7 
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■3GTACAAAGTAAGCACC\-AATAAATGTTAGCTATTACTATCATTAlS , Au 2" 

ATTATTTTATTTTTTTTTTTTGAGATGGAGTCTCACTCTGTTGCCCAGGC 

TGGAGTGCAGTGGCGCAATCTTGGCTCACTGCAAGCTCTGCCTCCTGGGT 

TCACGCCATTTTCCTGCCTCAGCCTCCCGAGTAGCTGGGACAACAGGCAT 

GTGCCACCATGCCCAGCTAATTTTTTTGTATTTTTAGTAGAGATGGGGTT 

TCACTGTGT7AGCCAGGATGGTCTCTATTTCCTGATCTCATGATCCGCCT 

GCCTTGGCCTCCCAAAGTGCTGGGATTACAGGCGTGAGCCACCGCGCCCG 

GCCTATTAT7ATTATTATTACTACTACTACTACCTATATGAATACTACCA 

GCAATACTAATTTATTAATGACTGGATTATGTCTAAACCTCACAAGAATC 

CTACCTTCTCATTTTACATAAAAGGAAACTAAGCTCATTGAGATAGGTAA 

ACTGCCCAATGGCATACATCTGTAAGTGGGAGAGCCTCAAATCTAATTCA 

GTTCTACCTGAGTAAAAAAATCATGGTTTCTCCTCCATCCCTTTACTGTA 

CAAGCCTC CACATGAACTATAAACCCAATATTCCTGTTTTTAAGATAATA 

CCTAAGCAATAACGCATGTTCACCTAGAAGGTTTTAAAATGTAACACAAT 

AT AAGAAAATAAAAATCACTCATATC GTCAGTGAGAGTTTACTACTG C C A 

GCACTATGGTATGTTTCCTTAAAATCTTTGCTATACACATACCTACATGT 

GAAC AAAT ATGT CTAACATCAAGAC CACACTATTT ACAACTTTATATC CA 

GCTTTTCTGACTTAGCAATGTATTGATGACATTATGCATGCTTAGACCTC 

C 

>Concig34 

GTATTCTATTCTCGGTTATAACACAATCACAGTGATTTGTCATAT-GTTTC 

CAGGATTTGTTAATTTCACTTCTTCAGCTGTTTCCCCCTTGTTGGCTGGA 

ACTGATTTTCTATCTTCTGGGAGAATCTTCAGCAAGCCAACTCAGGATTT 

GTTGGGTGCATTTTGTCAAGTCTAGGACCCAGGCTCTGGGTGACTGATTT 

CCTCTAATTACCGAGCAATGTAAAATGAGGAAGTCTGATTGTGTAAAGGT 

GTTAAACTTTTGTGTGACGGCAAAACTTTAATACCATGAATAGAGATTCC 

AGAATTTTCCAACTTCTAACGGGATTCCTTTCACTCCCTGACATTAGAAT 

GTTAGAAAATCTACCACAAAACATCTGTGAGGCTATCCTACAAGGCCCGT 

TTTTCAAAATAGGTTTTTACAAGGATTGCTATTTGGGATGATAGTTTCAG 

AAAGGCGCTATCAAAGTTAATTGATGATGTGTGCAAGCTGAAAGTTATAT 

GTTAGAACTAGCAGTGATTTCAAAAATATCCCTTTTAGGCTTTTTGCTAA 

TATATCTGCTCATTTTCAAAGTTCCCAATATTATAAAACTTTTTAAAGCA 

GAAAGAAGAACCCTCCATTTCTGCTGGCCCCTTCCCTGTTCAACTAAAAA 

GTATTTTCCCAGGCAATGCTATCCCAGGACTCACACTCCATCCATCCATC 

ACCTACCATAAGTTCTTTGAAGGGCTCATTCTGAGCGCTTCCTGAGTGCC 

TGGGA7CTGTTATTTCTCTCCATTTCTGCTGCTGCATGGTAGTCCAAGTC 

CTCCTCCCTTTTCCCCTAGGCCATTTGAATCATCTGCTAATTGGTTTTCC 

TGATTGCCACGGAAACTTCCTCCATCCCTTCCTCACATATCAGCCACAGA 

AGTATCTG CAAAAAGCAAATCTGGTGACATGAAGCCCTTGCACAAAACC Z 

ATTCATTACTGGTTCCACACCTCCTTTGTGGATAAGTTCAAGCTCCTGAG 

TGTGGCAAGCAGGGCCCACCTGGAATCCCCTGCCCTCCTCTCCTATCCCA 

CGCATCAATCTTTCCTGTCTATTTGCAGTTCCTTGAATGTGATATTCTTT 

CTAGTCTCTGTGCTTTTGCATAACCTGTTCTTCCTGACTGGAAACTCCTT 

CTCCTCCTTGTAGTTTGGCTAATTTCTAGTCTTTCAAGACTCAGCTCATG 

CTTCACCCCCTCTATAACAAGTCCTTTCCCAAGCTGGGTGGTGGATGCTC 

CTCTGTGCTGTGTGAGTCTTGAACATCCTCAGCAAACCTCAGCTTTGTTT 

GCTTGTCTCCCTTGCTGTCAATGCACCTGATTCAGGGCTGGCATATACTG 

TTCACCTCCATGACTGGCTCATGGTGGTGCTCCGTGAATATCATCCACCC 

AAACGGATGAGAGCTACCATGCCATCACTTGTGACTTCCATCTGGAGCTA 

ACCTCCCCCGACAGGAAAGCGTTTCCTTAGGAAAGAATATCTTTGGGTTA 

AATAGAAGTAGAGACTCACCAGAAGCACTATGTCCAGCTCAGAATGAACT 

GCTCAGTAAGCAGCCTTGTCAATGAGGAGGCAGCAGGCCAGCCCCAGAGG 

CCTCAAAGTGGGAGAGTAGAGAAGCGCAGTTCCTGCCACAAAGGCACAGT 

GGACACCTTGCTCCCCTGGCTGGCTGGAAGCAGATGGTGTCCACCTGCTT 

CCATGGGAATTCTGCACCTTTAATAAAGTTTTATGGGACAGGAAGGTGAC 

TGGCATTGACATTGTAACGAGGAATGGGTGGTGCCACCTTTGCTGTGTCT 

TACCAGAAATACCTGTGGCAGGTAAATTTCTAGAGAGACCCTCCCATTTC 

TCCCATATAGCAATTTTGAAATGTTTCCTGAGGGCTTTCCAAATTCATCT 

GGGAACATAGGAGTTCCAGAAAGATGAAATCAAAGGTGATGGTATGCCAA 

AGAAAGTAGCTTTTAGAATGACTTACATTAGCCATTCATCCATTCAGCAC 
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ACCAGGCATTCAGTTTGAGGGGTGTGTGTGTGTGTGTGCGCGCGCGCG iG 

CGTGCATGAGTGCATGCGCGCGCGTGTACATAGGGGAAGGGAAACAAAAC 

AAAAGTACACAAGACATGATAGTTGTCCTCAAGGAGTTTTTGCAAATGTT 

CACAATTTAAGAGAATATGCTGTGCTGTGGCTGGTGTATAAACCAACTGC 

TAGGGAGAGGCCTTCCACACACACTTGGGGCAAATGCGACCTCTAGGACT 

GCCAGTGGAATCTGGGCATGCTGTTTGTGGTCGATAAACCCTGGTCCCTT 

GATCAGGGACCTATGTTTACTTTTCCTCTCCCTGGAAGTCTTCATTAGTG 

GGtATCCAGAAGGTCTTGCACAGGGCAGAGGGAGGCACAAAGACAAGAGT 

TTGAAACCAGCCTGGACAACAAAATGAGTTTCTATCTTTACAAAAAAAAT 

TTTTAAAAAATTAGCCAGGTAGGATTGCATGTGCCTGTAGTCCCAGCTAT 

TCAGGAAGCTGAGGCAGGAGGATTCCCTGAGACCAGGAATTTTGAGGCTG 

CAGTGAGCTATTAAGTTGGCGCAAAAGTAATCGTGGTTTTTATCATTAAA 

AGTAATGGCAAAACTTTTAATGACAAAAACCGTGATTACTTTTGCACCAA 

TTTAATATGATTGCACGACTGCACTGTGCTCCAGCCTGGGCAACAGAGTG 

GGACCCTGTCACAAAATAATAAATAAATAAAATGTAAACATGTAAAAAAA 

ACCCCAAAAACAAAAAAAATGGGTGTTGAGACCCCTGAATTGAGGAATAA 

TAGGAAGGAGTGTGATTCTGTGTGTGCATGCATGGGTGTGCACCCTCAGT 

GCCTGGGTGGCTTACCCTGGGCTAGTTCAGGTGGCAAATGGTTTTCCTCC 

AGCTGGGCTACCACCATCTTCCCCCAGGGCCTGTCCATGTATTTGGTGGC 

AAGATACCTATGGACTAGAGTCCCTCCTCAGAGGAAAGGCTCCTCCCATT 

TCTCTGGCTTTCAGGTAGTAGTCCATGACTTCAACAGGTCCCCAGTGCAA 

TGTTATGGGTTAGTTTAGGTGGGGTCTCCTCTGAGAGCCTCCCATAGCCC 

AAAAGGCCCTGTCCTAGCTGGCACTGCATCTCCCTCTTCCCAGCTCTCAG 

CCTTTCTCTTTGCTCATCCCACTCCGCACAGGCTTTCTGCCTGATCCTTG 

GATGTGTCAATCCTGCCCCTAAGGGATGCAAGGCAATTTGTCCTTTTATT 

ATTAAGATCTCTCCTGAGGCCACGTGTGGTGGCTCACACCTGTAGTCCTA 

GAACTTTGGTAGGCCAAGGTAGGAGAATTGCTTGAGCTCAGGAGTTCCAG 

GCTGTAGTGAACCATGATTGCACCATTGCATTCCAGCCTGTGTGACACAG 

CGAGACCCTGTCTTTTTTCTTTTTTTTTTTGAGACAGGGTCTCGCTCTGT 

CATCCAGGCTAGAGTGCAGCGGTGTTTTTCTGCTCACTGCAGCCTCAACC 

TGCACATTTTTTGTAGAGACGGTGTCTTGCTATGTTGCCCAGAGTGGCCT 

CAAACTCCTGGGCTCAAGAGATCTTTCCACCTCAGCCTTCCAAAGTGCTG 

GGACTACAGGCGTGAGCTACCGCGCCCAACAAAGACCCTGTCTTAAAAAG 

AAAACAAAAATAAACAACTCCCTCAAGTCTTTTTTTTTTTTTTGAGACGG 

AGTCTCGCTCTGTCGCCCAGGCTGGAGGGCAGTGGCGCAATCTTGGCTCA 

CTGCAAGCTCTGCCTCCCGGGTTCACGCCATTCTCTTGCCTCAGCCTCCC 

GAGTAGCTGGGACTACAGGTGCCCGCCACCACGCCTGGCTAATATTTTGT 

ATTTTTAGTAGAGATGGGGTTTCACTGCGTTAGCCAGGATGGTCTTGATC 

TCCTCACCTTGTGATCCGCCCGCCTCGGCCTCCCAAAGTGCTGGGATTAC 

AGGCATGAGCCACCGCGCCCAGCCAGACCTCTTGAGTCTTAAACTCCTCT 

GTAGTTCCAGCCACCCTTTAGCACATGACTCTGTTAATTTTGTTCTCACT 

GTCTGAAATCATCTCCTGTCCACTCTTGACTGACAGGTCTCTGCACTAGC 

CCACTGCTTAATCAGAGTAGGTCCCTGTCAACTTATTCATATTGTGTCCC 

CATGCCAGTGTGGATGATTAAAATTGTTGAGTGGAGGCTGATCAGATGAG 

CCATCTCCTTCCAAGTCCTCACTTGCTGGCTCCTGTCTTAGTTTTAGTCC 

C CATT CTT CAAAG AACGTG AG C C CTGGAAAGTATTTTAGT CATTT AGTT C 

AGTGCCTTTGGATGGGAGGATCACATCCCTGGGTCCCGTCCTGCAGACTG 

TTTTGCTCTAGCTGACTAGGCAGGATTCCCTGCCTTCTCTCACTTCGGCA 

TGGGACTTCCTTCTGAAATTGCTGCTCAGTCAAGAGAATGACCTTCCCCA 

ACATAATCCTACTCCACAGGGACTTAAAGGTGTGTCAGAGATCTCTTGCT 

CATCTTTCTGGCCAGGTGCCAACGTCAGTTTATAGCCAAGGGACAAGACT 

AGTTAGCAGATCAGGCAGGTCTTAGACCCCAGCGTAAGTGCCAGACTTCT 

AGCTGCAGTTGTTCCTGCCCACACTGGGCGTTCAGGTGGAGAGAGGGCAT 

GGCACTACACTGAGCTCTCGGCGAAACCCAGGACTCTGAAATCTCGGTGT 

CAGCCACAGGCCACTCTTTTCAGCAGGACTTCAGTCAGTCCTGTCACTAG 

GCTG7CGAGCACATGGTAGGCTTTACCCC 

>Connia3 5 

AAGGAGTGTGCTTGCTGATAGCATGTGTGANGGGACGAGGAGTAAATAAT 
TT CTG C CTT CAAGAAATTG CAAACTAGTAATGGAG ATAAAATC AAC AGAG 
G AAC AATT AG AGTAT AAGGT AAAAT CTAAG GG C C ATAAGAG AGG AGAAG A 
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AGTATGGGAGTTCZAGAGorAGGGGGTAAATGAGGGGAGTAGGTGGGTAGA 

AAAGG77AAAAG7AAA7AA7GA7GGGAAGGAAGACAAAAAGACGACAGGG 

G7GCCAAAGGACTCTTAACCTCATCTGAACGGAGTTGCCCTGTTTTGCTC 

7C7GA7GC7CA7G7A7C7A7CC77AGAGACAGC77GGCGGGCAA7G7AGA 

GCG7AGGGGC7GACA7AGGGGG77GGAG7CCCACC7CCG7GAC77C7AGC 

AAA77AGCAAAC77TGC7GC7GC7AAGCC7A7AAGGCGGACAGAAA7GCC 

A7C777AAAGC77G77A7G7AAAG7GCC7AGGACC7CG7AGGCATCAACA 

GGAA7AA7GGA7GAAACAAAACAACGG7GCG7A7C77GGAGAAAG7GGCA 

TCTGAGCAGGAGTATTTTGAAAGGTAGGAAAGGGCTCCAAGCACATCTAA 

GAGA7TAGGGAACGCAGAAGCCTTAGCCCTGGGTGCAGATTTAACCAATC 

AAC77C7AACCACCGCAGGC7GAGAGG7G7GGAG7GAGAGCCCCGCCAGA 

GGCAGGAGACCCGGGCTTCGGCCAGACCCCGCCTCCTGGTACAGAGGACC 

ACGCCCGGCTCTGCCTGGAGCCAAATGTGGATCAAAACAGCGCGCAGCTT 

CCCACTGCTGGTGAAAACCCGAGCAAGGGGCCTCAGTTTCTTTATCCGGA 

ACGTGGTGACAATGACATCTCTTTGCAAGGCTGCTGCAGGGCTTTCTGGA 

AATACGCCCGTGAGGTATCTGGGCCTGCGCACAGCCTCCCCCGCCCAGGA 

CCCAGACGTCTACCTGGGGGTCCCGTCTGCGCTCCCGGGATGGAAAACGC 

CCAGGGGAAACTTAGGCAGGCGAGCGGACGGGCACCTCCCGCGGGACGAA 

CTCACTCGGTGGCCTCCTACTTCCCCGGCCGTGTTCCAACGCC7GAGAAT 

AACGGGAACAGCGGTCGTACTCACCGACAGCGGCAGCAGCGGTAGGCCCG 

GGCCCCACCATGACTCTTCAGTGACAGTTTTTCTTCAAACGCCGCSCCTG 

TAGCCAGGACCGGCGTGCCGCGCGTCCACGCGTCCTCATTGGCTCCTGCG 

GGTTTGAAACTCGCTAGTCGTCAGCACGGGAGGGCGGGACAACAGGCAAT 

AGGCTCTTTGCGGTTGGCTCTGGCCTTGAGAACCCGACCTTGGGGCCC7T 

7GA77GGAAGAACG7GCAGCGCACC7CGGCA77GAGGGCGGC77CC7CGG 

GGCGCGGCGCCGCCCGCC7C7GAG7GCGCC7G7GAG7GCGCC7CCGAG7G 

GGCG7GGGACCC7CCG7GGGGGCC7CAGCCGGGC7GG7GG77GGGGGGCG 

G77ACGC7GAA7CCAGC7GGGG77GGCGCGCCGGGAG7CCC7GGGCGGAG 

AGACAGGGCGG7CC7CCCAGGA7GC7GGGGCCGC7ACC7GA77C7GTCC7 

77CAAAG7C7CAGACTCACAGGAGC7G7GAAAAAA7AA7A77A7AAAGAG 

GACA7A7GGG7C77ATGCA7C7AAAGGC7CC7AG77C77AG7AC7GCAGG 

G7GGC7CG777AA77G7GG7AAAA7A7GCA7AACA7CACA7A7ACCA777 

7AACCA7777AAAG7G77AAA77777CAAAAA7G7GCAG777AG7GG7A7 

TAAG7ACCCTCACA77G7GGCACAGCCACCAC7AC7G7CC777CCAGAAC 

77777CA7C77CCCAAA7GAAACCC7G7ACCCG7CAC7AAC7CCGCAC7C 

C7CCC7CCCCCAGCCCCAGGCAA7CACCA77C7AG777C7G7C7C7ATGG 

ATTTGACAACTGTAGGTGCCATATAAGTAGAATCATGCAGTATTTGTTCT 

37GAC7GGC77G777CAC77AGCA7AAAG7A77CAAGG77CA7CCA7G7G 

7AGCA7G7G7CAGAA77TCC777CC7777AAGGGGGAA7AGCA777CG77 

G7G7GGAGA7GCCACATTT7GC77C77GG7CCA7CCC7C7CCGGACAC77 

GAG77GC77CCAC7TTTTGGC7ATTG7GAA7AA7AA7A7GAACA7GAA7G 

CACAAA7AAC7C777GAGAC7C7CC7777CA77C7777GGG7A7A7ACCA 

CGAAG7GG7A77G7TGGATCAAACGGCAA7TC7A77777AA77TT77GAG 

AAAC7GCC77AC7CCTCTCACGG7GA7C7C77G77CAAGG7A7A777TCG 

A777CACC7GA7CAGC7GAC7A7AAGGCCATAAGGC7AACGGAGAAACGC 

AGGCC7AG777CTCCTAGTTACTAGGAGA7CGCAGGCC7CG77G7CC7GA 

A7CCC7AGACACAC77CA77CCCC77G7777AA7CC7AAA77777777C7 

777GAAG777G7CCTGTTTCA7C7A77C7CCAG777C77AAAGAGG7C7G 

GAAAA7GC7777GGCTCCTTG7G7A7GAAGG77CC7C77CCA7GGA7GC7 

GGAGAAG7CG7G7GTGGAGGGGCAG7CA7A7C7GGGCACC7G77GGCCAG 

G77CAGC77ACCAGTTGGG7AC7CAGCAGGGCA7GAAGCCAC7GCAGCAG 

CCC77C7C777AGCCGTAAA7AGGGAG777GGAAGAGAGCCAGGG777C7 

GGA777A7GCA77TTGA7A77TTCAA7AG7G7A77AAA7G777AAAA7AG 

3AAAAC7GA7CA77A77777G77AA7GAC7GAGAAAGGGAC7CC77CACC 

AACAG777CAGAAAAG7GAAGGCGG7777G7777GG7C777G7AGAA7C7 

AGG7GG77GAA7GCATG7CAG77G7AGAAG7CACC77GCC7GA7A7CCCA 

C G C AG7G C7GG AG7 A77C CACAG AC C C CA7G 7 AGG7 AC7G CAC C777GCA 

GG7A7AC7GC7GG7G7TGG7GAGC7GCC77ACC7G7CC7G77A77GGAGA 

7CCC7GC77A77AGGAAAC77AAAA7GAAC7CAAA7GAGC77CC77GC77 

AC7GG7C77AGTCCTTTGGAGCAACATAGGCCAGTTCTGCCTCGTTTT77 
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7CCA7CC777GGG7A77TGACGG7C7A7777G7AGGACACAAAA7G7GGG 

AAAA7AGC7AGGCAGG777AAAAA77C7CAAC7C7ACCAAGCA7GG7GGC 

77A7G7C7G7AATCAA7CCCAGCAC777G7GAAGC7GAGGCAAGAGGA77 

GC77GAGCC7AGGAG777GAGACCAGAC7GGGCAACA7AGCAAGACC7CG 

777C77AAAAAAAAAAAAAAAA77ACAAAAA77AACCAGGCA7GG7GGCA 

CACACCTGTAGTCCCTTCTACTCAGGAGGCTGAGGTGGGAGGATCACTTG 

AQCCCAAAAGTTGAAGGATGCAGTGCACTGTGGTCATGCCACCGCACTCC 

AGCA7GGGAGGCAGAGCAAGACCC7G7C7CCAAA7AAA7ACA7AAA77AA 

A77C77AAC7CA77CA7CAAAG7A7CCAC7G7AGC777CCA7CA7CC7GG 

7G77G77777777AGAAGGA7C7GGC7CCA7TGCCCGGC7AGAG7GCAG7 

GG CATGATCTCAGCTCACTGCAGCCCCCACCTCTCTGGCTTAAGCGATCA 

CCCACTTCAGTCACCCATCTGGGTAATTTTTGTATTTTTTGTAGAGATGG 

GGTTTTGCCATGTTGCCCCAGGTTGGTCTTGAACTCCTGGCTCAAGCGAT 

CCATCTGCCTCCATCTCCTAAAGTGTTGGGATTACAGGTGTGAGCCACCA 

CACCAGGACAATCCTGGTGGCTTTTAACGGTTTTCCATTGCTCTCAGGCT 

AATGACCTATAAGCCCCTGCGGGCTTGGCCTTTTACTCCCTGAGCATTAG 

CCACCTCCCTTAGCCTTAGCCCACACTACTCTCCCCTTGCTCAGTGTTAT 

CCAGACACTTTGTTTTTTCCTTTCCATACTCCTCTCTGTCTGGGAATCCA 

ACCTTTCTTTCTCATTTCTCTAGTTGATTATTAT7ATTTTTACTCTAGCA 

GCC77A77GAGATA7TTACA7ACCG7ACGA77C7CCCAC77ACAG7GTAC 

AA77 C AA7777 C7AACA7777CA7C ACCC C C7AAAGAAAC C C7A7AC7CA 

77AGCAG7CAC7CCCCA77C7CCCC7CC7C7CAGCCCC7AGAAACCA7GA 

A7C7AC7A7CCA7C7C7A7AGA777GCC77C7GGACA777CA7A7G7A7G 

AAA77A7GCAA777G7GG7C7C7GA7GGGC77C7777G77ACCAAAA7A7 

CA7GGG777GA7C7AGG7CC7GC7GC7CGC7GCACAGAAAGCCAGCCAC7 

GAGA7GACAAG7A77GCCAAGGAAGAAGGC777AG7CAGG7GC7GCAGC7 

GAGGAGA7GGGGGC7CAA7C7CAAA7CCA7C7CGC7GACC7AAAACCAGG 

GG777GGA7AGCAGGGAAGAAA7G7AACAA7GCG7AAGAAAACAGGAACC 

AGGGAGGGGCAAGGAAGCAA7CC7GA7GAA7GAG7GG7CCAAAGTC7CA7 

TGCC7GGA7G7GG7GA7C7GGCGAG777CAG77C777GA7AC77TTT7TG 

AGAGGCC7GAAG7C777TCCCCAGGAAGGAAC7CAAACAAAACAAA7ACA 

AGC77CCAGC777AAGACCAGAAGCG7CAA777C7A7G777A7CCGAAAG 

AACAG7C7A7GGGAC7ATTGG77AAG777CAC777CAC77AG7ATGC7G7 

7T7CAAGG777A7CCACA7AGCA7G7G7CAG7AC77CA77C777TA7GAC 

7GGG7A77C7A77G7GCGGA7A7ACAA7A7777A777GCCA77CA7CAG7 

7GA7GGACA7C7AGG77C7T7CCAC77777GGC7A77A7GAA7AA7GC7G 

77A7GAAC777CA7G7A7AAG77777G7G7AGACA7A7G7777CAACAC7 

CA7GGG7A7A7ACC7AA7GAGAGGAA77AC7G7G7CA7ACGA7AA77C7A 

7C777AACCA77TGAGGAAC7GCCAGAC7G7777CCAAAGCAGC7GCAGC 

A7777ACA77CC7ACCAGCAG7G7A7GAAAG77CCAGT77C777ACA7CC 

7CAACAACAC77G77A77G7CCA7C77TTAAA77ACAACCA7CC7AG7GG 

77G7GAAA7GG7A7CACA7TGTGG777T7A777G7A7T7CC77GATGAC7 

AA7GA7G77AAGCA7C77777A7G7G777AC7GGCCA777G7A7A7C7C7 

A77CAGAG7C777GCCAATTTTTAAA77GGG7CAG77G7C7TC7TCC7TT 

777777GAGA7GGAGCC7CACTC7G777CCCAGC7GGAA7ACAG7GG7G7 

GA7C7CAGC7CACTGCAAC7TCCACC7CC7G7G7TCAAG7GA77C7GGTG 

CC7CAGCC7CCCAAG7AGC7GGGA77ACACGCACC7GCCACCA77CCCAG 

C7AA7777777C777G7A7777GAG7AGAGACGGGG777CACCA7G77GG 

CCAGGC7AG7C7C77TG77GAC7C77AACCA7CC77CAG7C7CAGACAAA 

ACA7CCC777C7CAAGGA7TG7GA77AGC77GA77A7TTGC77A7C77TC 

7CCCTGCTAGTCTGTAAACTGAGGGTAGGCCACTATATTCATTGTTCTTG 

GCACCAAA7AGAAAC7AAA7TAA7G7C7777GAA7GAA7AGGGC7T7C7C 

C7777AAAGA7CCC77CAA7ACAG7AACCACAC7A7A7A7AAG7AGCCAC 

AAGCCCA77CAA7AA7AC7AC7AG7NC77GCGCCAAACC 
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GGC7CAGCG77AC7A7AC7GG7C7CAAAC7CC7GGGC7CAAGCGA7C7GC 
CCCCC7CGGC77CCCAAAG7G77GGGA77A7AGGCG7GAGCCACGG7GCC 
7GGCC7CAAA7AAC7A777AAG7GAAACAAAAC7AG7A7GGCAC7AA7GA 
AAAA7G7A7AAA7CCATAATCGCAGAGGGATTTCAACT7ACTTCT7TCGA 
7TA7G7AAAGG7CAAACAGACAAAAGACAATGACAAAACTTAATGCAATG 
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AACACTTTTGATTTAATGAACATATATTGGATATGTACCC^GAATTAGA 

3AATACATACTAG7TTTGAGTTTATGCAGAACATTTACAAAAATTTAGTG 

GAAGCCTAAATTATAAAAAGTTGCTGTCACGTAGAATAACACACAAACCC 

CTGAGTCCGGAATTCAAAGCCCTCCACACTCTCCTCTACCTTTGCATCTT 

TATCCTCCACCACACTGCAGTGCATACTCTGGGCTACTACTCACTGTTCT 

TGATTCAAATTCCATGTTCTGTCAGCTCAAATCATTCTCTCTGCCTGGAA 

TA^CTACTTCATACATATTCTGCTATTGAATTCTTGTCTTAGCACCCCAT 

CTACTCCAAGACGATGTCCAGTTGGGGTTACTCCCTGTCCCATTTTCTTT 

GATTACACTTTTTTTTTCTACTTCCATTATATTATTGATCACATCTGTGC 

CACAGTTTTTGACTTTGTGTCTGCTTTTACTCTTTTCTAGACCCTGATAG 

CTCCTGAAGGGTTGGGTCATTTCTTTTTTATTTGCTCATTCCTCATGGCA 

CAGTGAGTGCTTAATAAATGGCTATTGACTGAAATTAAACTGTATCTAAA 

TGGACATATTCCACTTCTGGGCCATTCATTCTTTCTTTCTATTGGAACCA 

GGAGATGGGGAACCATAACAAAGGTAAGGTTGTGCCATGTGAAAGAACAT 

GGAACCTTCCCCTGAGGGCCAAAAAAGAGCAGGGAAAGGTGCAAAGACAA 

AATCTTCCATTTTTAAACAATGTAAGAATGTGGTCCACCTCATGCTCAGG 

TGGGACTTTATCATGACGTTATTTTTGGGGACTTATAGCTGCATCATTTA 

CCCCATATACATTTACCTTTAGTGTAGGGAACTGAGGACAGGAATTTTGT 

TGATGCAGACTCTTGCTAATGAGGCTAACACTTGGAGAATTTTTATCATG 

CATTCAAGAAGCTTGTTTTACATTTCTTCATTAATACTTTAGTTGGTGGT 

TTAGCTTTAGTTGTAGGCTTATCAGATATTTGGAGATATCTTCATAAACG 

ATGGCTTTGGTTTTAGAAGAGTTATTCTGAAGCTACTATTTCTGGCAATA 

ATCAAACAGCATGGCCATTTGTTTTGTAAGGCCTTTCCTAGAATATGACG 

GTAAAATCTACGTGTGGAAAAATGCTTATTCTTCTGTCCTCTATAAATGT 

GAATCTAGTTTGTCTTCAAAATGAAATCAAGTGATTAAAATG7AGTTTTC 

TAAGAAGATAAATGGAGCAAAGCACTCTGTGTTTCACAGTGTTGGAAATC 

ACTCATCCCTCATAAAACTGTCCCAACTGATCCTGACTCACATGAATGAA 

TTAAAAT AAG AGTT AAT AACAT CAATTT ACATTTTT AAAGACAC TTT C C C 

ATGTTTTAGACTATTGGTTGGAAAAGCTGGTAGGTGTACAATTTGTGGAG 

AGTTGGCTGTTTTTGTCTGTCGTTGTTTGACGTATTTCAAAGCCATATCT 

AATTTTGTTGCAGAATGGTCTGAATTCTACAAAAATGTTGAGTTGTGTAG 

TGTGGAGAAGTACGGAGCCATTTACTGAAAGGCTGGGGGGAAATGACGAG 

ACCCTGAGATAAGGCAGTAGTGGTGCGAACAGAGTGGAAGGGAGGTAGTT 

GAGATATGTTCAGAGTAGAATCAGAATGGACATAGTGAACAACTGGATGC 

AGGTGGGGGCTGAGGAAGCAAAGTTGAGGATAATTCTGAGACTTCTAGGT 

TGATC CACTGAAGTTACATTATTCAACAC CACAAGGAAACTAGGGGAATG 

AGAAGGCATACTGGTTTGCTTTGGAGTGGAAGGGCAGTGATGTAAGAGGA 

3TTAATGAGTTAAAGTTTGGATATGCCTGAACTTCAATTTGATATGTGCA 

TCTGATATACCCTTGGGGTGACCCTCCAGGCAATGGTTGAACATGTGTAT 

TTCTTAGTAACTGATAGGCATCACAGACTCACATCAGTAAGGAAGCAACA 

GCAAACTTGATTGGACGATATACCTGGAACTCAGTACCCTATGACTGGAG 

CAAGTCTCTGTCAGTGAAATGAGGATAAGAAGAATCTTGACCTTGTGGAA 

TATGTTGTTAGGAATATATGTGATGAACAACATAGGATACTTCCTACAGG 

GCTCCACATGTAGTAAGGGCTTTATAAATGCTTGATAAATATTATTGTTG 

TAATTTATTTCCAAAGTAAGATGCCACTGGAGGAATCTTTGGAACCCAAA 

TTAATAACAAATAGGACTGGATGCAATGGCTCACACCTGTAATCCCAGCA 

CTTTGGAAGGCCAAGGCAGGAGGATCTCTTGAGCCCAGAAATTCAAGACC 

AGCCTGGGTGACACAGGGAGACCTTGTATCTATGAAGAATTAAAAAAAAT 

TAACCAGATGTGGTGGTGCACGCCTATAGTCCCTGCTGCTTGAGAGGCTG 

AGGTGGGAGGATTGCTTGAGCCCATGAGGTTGAGGCTGCAGTGAGCCATA 

ATTGTGCCACCACACTCCAGACTGGGTGACAGAGTGAGACCCTATCTCAA 

ATAAATAAATAAATAAATAAATAAATAAGTACAAACCAGCAAACACTAAT 

CCTTTCTAGAGATTATTGAACTCTGGAGGGCAGATCTGAATGGAGCCAGC 

AGAGGGACCTATGGAGATCAGCCTGGCCCTGGACAGCACCAGGCAATGGG 

GTTGCTAGAGAGGTAATGGGGTTGAACAGGGTTTAAGCCATGAGGTCTCA 

AGAATCCGTGAAGACTCAGACTAATTTTTTTTTTTTTGCATGAGGATTAG 

GTGTTCCTAGGAATTTCAATGAGAGCAGGGTTAATGAAGGAATGCAGGGT 

AGGAGAGCTGAGGGAAGGCATCTGAGAGAGCCTGGCTTATGAATGGCTGC 

3TCAGTATGGCTCACCTGCTTTCCTTGTATCTACTTAGCAGATGATCZCA 

ZCCCAGGCCTCCAGGGCCAAGGTCATTTCCACATAGTCATGGGCCCTTGA 
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GGGC C7GGAGCAG7G7A^^AAGACAGAG7C77AAGAAA77GCA^^C. . 

GTCATGGTGCTTGGCAAGTGTCGTCATCCTATGCCAAGCCTGATCTGAAG 

GGG7GCA7GC7CA7AGG7AGC7GC7GCCCAAGA77ACAGCAGC77C77CA 

A7CCCAGA7CCA7GC7C7CC7A7A77CA77777CCAGGGG77CC7G7CC7 

7CGACAG7GA7GAGA7GCAGAA7GAC77A77GAG77A77C7CC7GA7AG7 

TGCCAACTTTTCCAAATGACAATGGGGCATGGAGCTTGAGAGTGGAAATG 

AGGCCC7AGGGA7AGCG7GC77AGGAAAACAC7CCCAGCC7GA7G7AA77 

C7GGGGG7ACAA7GGCA7777CA7CA7CAAGAC7GA7G7AAAGGG7GAC7 

AGCAGTGAGTTGGGGGTGACTCGCACTGGGGCTAGG7TTCTGATTCTGCC 

7AA7CCAGACAGAGCAGAAGCAC7AG7GGGC7GG7AGAGGGCC7CCAGGG 

CC7CAC77AA7G7CC7GGAAAAACAGC7CCAGA77G77GG77CACG77C7 

GAGGACAAGCTTGGGTACTACAGGATAGAGAGAGTGGTGGGAGATGCCGT 

GGCCTGCCCTGCTGATGCCTGCCCTGCCATTCCTGCGTGTGATG7CTCTG 

GGGCATCTTGCCTTCCCTGCCCAGACCTGTAGTTCAGCTGAGGGCATG7G 

GAGGCCAAA7GGC77C77AGAG7G77AC777CC77GAACAGC7C7GC7GG 

GAGAAC7GGAGGAGC7AGC7AG7CACGG7AAC7GCAGCAG7CAAAGGA7C 

G7CCCGG7GGAGG7GGGG7GGAAAGG7AGAGAAAGAGAACA7A7AGCG77 

T7CC77GGAGA7G7G7GGGCA7G7CA7AGAGGAAA7ACCCAA77CC7GAG 

CC77GAGCCC7CCAGGAAACC77GGAA7A77AGG77AG7CA7CCCCAAGG 

AAG7C7AAGAA77CTGG7CTCACCCA7CTCC777AA77CCCACAA7GATC 

CTACA7GATAT7AAGGAACACGGGCCAGTAACCC7CCAAGCAATGGATG7 

GGTGG7GAAGT7TGACCTCATGATGGAGCGGAGGTTGGTTTGAAACCTAA 

GAA777AA777A77G777CAAAC7G77C7CCAC7CAGCG77A77AAAGCA 

TACA7AA77GACACA7AAAAA77G7A7A7G7C7ACGG7G7ACAA7G7GA7 

G777CGA7C7A7G7A7ACA77G7GAAA7GA77ACAACAAGC7AAA7AACA 

7ACCCA77CA7CG7G777CAAAGGAA77AAAC7CAAGCACAAAAGAGAGG 

TGC7G77GAAGAG7AGGGC7GC7C7A7C7AAG7AG7A7G7C7GGGG77G7 

C C7GG A7 C AGGG7CC7777G7GC7AG7AA7 AAAC CAGCCC77C7GGGGC7 

GC7CCAC777CCCCACA7777C77C7GGAGCC7CCC7AAGAA77AGGACA 

7GGCCAC777C7C7GCA7AGGC77CC7AC77CAACAAGGACAGGGC77G7 

GC7GCCCCA7GCCAC77GAG7G7CCC7ACAGCACAGAGC7GAG7GCACAC 

7GGC7GAG7GAGGAAA7CCCCCAGA77AA7C77GG77C7AAGCA7CA7GG 

CTG7A777CACACG7A7A7GAA77ACAAA77ACAGCA7AG7CGAA7AAGG 

A77777G7GC7ACAAC7GGAA7CCCAGA77A7GCAAA77GGA7AG7A7AA 

7A77GAAA77CC7AGGAC77777A77AG7777AAAAAA77A7ACAAGC77 

\GAG7AAGAAA77AAACAG7GCAAAAGAA77CAC7G7GAAAAG7AAAA7G 

C7C7G7C7C7GC7GAGAGACAGA7A77GCAGCCCAGA7AC7AC7GGGG7C 

AA7AG7TT Z C777AAGCA7GCCA7777GA7GG777A7GGGAC77ACAGC7 

CAAGAAGC77GACAC7AGGG77GA7C7CAGAAAA7CA77G77GCAGG7A7 

TAGA7A7GACCG7C7CA7AAAGA7ACACACACAGACACAGCGA77GGAGA 

7A77CAC7GGGGC77A7GGGC7GC77G7CC777C7GC7C7G7GCC7AAG7 

7GGGC7CAGAG7AGCC7GGCA7CGGC7G7GGGGAGAA7GC7GGCA7GGGG 

77AGCAGGAGCCCAC77AACA7G7CC7AAGCCACC7GGAAGAG7CC77CA 

AGGAGACCAGAC7CCAGAGGCCC7AAGGAAGGAAGGAC77T7GCCCG777 

77AGG7A77C7AG7CCCAGAG7TTAGGGAGGAA7GG7TTGGC777GGG7C 

GTG7GC C C CT77ACCGAGTGGGA7GGGATGTGCCCA7GAGCTGTTGAGC7 

GGC7C77GGAGAAGACAGCAAAAGCGGGAA7AAGAGG7CAGGAAGC7G7G 

7GG77G7AGGAAA7CCCAGCAGAGGGCC7GGGGG7CAAAAG7GG7CA7GG 

7AG7GACGG7GGAGGC7GAGG7GG7AGAAAA7CAGAGGACAAACCCCA7G 

GGC7GC7GG7GA7C7GACCGAGC7CC7A7GC7C7CCTGG7TCA7777AGG 

C7C7G7AGCAGCAGA7GA77GGC7GG7G7GAGAGCAG7GCACC7GCCA7A 

7CAGGCAA7CCAAGACAAG7CCAAGC7ACGC7GGGAGGAAACC7GAAGGC 

AGCAGCAGG7AGAC7GGC7GAAGACAGACAGGCAGGCAAC77G7CAA7CA 

GA777G7G77777AAGGAC7777AAC7GGGGAGCCC7CCGGGACAGA7CA 

GATGAGAG7GAAA7G7GC7CCGCC77AGCC 
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GGCCG77CGCAA77C7G7AAAAGGGAGAG7GG7777A777A77777AAAC 
A7AG7CAAGC7GC7AAAG7A7A7GA7A7G7A7AGA7AGAG7A7AA77AAA 
7 AC777 G AAC7ACAG ACAAAA7 CAGGAGAA7GGAA77 AAAAAAC AA777 A 
G AAA7 G G G7 AA7 GGCAGCA77G GG77G C G C C CAC C CAC G AG AAGG C AG AC 
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ACCAAGATTCTAAGATC^JACGTGGCCAGCACTTCA^ 

77CG7GA77A7GCA77A77777C7CGGAAAG7777CAC77CAC7A7A7GC 

7AC77GAGAC77GC777CC7AAGACA7CCC7C7A77777GAGA7GAC7AA 

C7CAGCAA77CA777C7C7CACGCA7AAGC7G7CAC7CAACCCAAACCCA 

CCAAGCC7GCA77C7ACCC7CAA7AAGG7C77GG7G7G7AAAC7GACCCA 

CTTCACCTAGTTCCTTAGCCCTCTCTTGACCAGACATGACTCTTTCATAA 

GC7AGACC7A7AAAG7CAGGGC7C77AAG7AGC7GA7C7C7GATAG7GCC 

AA5TGTC C Z CCAC7G77CACA7777CCAC7CCAGC77C7AACAGG7GA7A 

GAC7GC77777GGGGG7AGGGGCACCAAAACA7A7AGACC7CA7G777GG 

A7G7AGACAC7CCAG77TCT77AAA77ACAAC7ACA7A77AA7AA7GAC7 

7CCAAG7G7ACA777CAG7CCAGA7C7C7CCC7GGA7CCCCAAAC777G7 

AAAACCCACCGCCTAGTTGATATCTTTTGATGTCTGACAGGCATTTCAAA 

777AA7AC7G7CACAAACAAAG77A77GA7777CA7CTC7GCA7C7G77A 

CAAA77777C77AC7TTGGTAAA7AGCACCCCAGGC7G7G7CAC7GCCAA 

GAACTTTCCACAGCTCTTGGAATAAAATTCAAAATATTTTCCAAGGCAGA 

AAGGCACAG7GTAATCTGGCTCCTGCCTACCTCTCCAACCTCGTATCACA 

CTAGTCTCCCTGTCACTCACCCCCTCCAGGAGCTCAGGTATCCTTAAAGT 

TTCTTTTCTTTTTTTTTTTTTTTTTTTTTTTGAAACAGTTTTGCTCTGTT 

GCCCAGGCTGGAGTGAAGTGGCATGATCTCAGGTCACTGCAACCTCCGCC 

7CCTGGG77CAAG7GATTCTTG7GCC7CAGCC7CCCAAGTAGCTGCAAT7 

ACAGGCGCG7GCCACCACACCCGGC7AA77777G7A77777AG7AGAGA7 

GGGGTTTCACAATGTTGGCTAAACCGGTCTCAAACTCCTGACCTCAAGTG 

ATCTGACCAC7TCAGCCTCCCAAGGTGCTGGGATTACAGGCGTGAACCAT 

7G7ACC27GCC7CC77GAAG777C77GA7CCAGAC7CA77CC7GCC77AA 

GG7C77GCA7C77CAG7CC7CCCC7CAAA7GACACC7CCA7GAAGACGCA 

A77ACC7G7AA77ACCG7G7C77A777AG7CAA7G7G77GG7777C7G7C 

7CC7CCAC7ACAG7G7AAGC7C7A7GAAGGCAGAAACC77GGCAG7CCAG 

77CCCAGCACAG7GCC7AGCACACA7AGG7A777AA7AACACACAG7AAA 

A77CACC7777AG7G7GCAA77C7GAG7777GACAAA7GCA7CAAG7CA7 

77AAG7C7GAC7A77A7CAAGC7A7AAGA7GG77GCAACAC7A7CAC7AA 

77CCC7CA7GC7CC77GG7AG7CAG7C7CACCCC7AACGCCCCCC7CC7G 

GCAA7CAC7GA7CCG777777G7C777A7AG7777GG77777CCAGAA7G 

CCAA7AAC7AAG7777GAA7GAA7GAA7GC7A77AAC7C7CA777C7GAC 

7CCAGAGCAACA7CCA7GCAA7A777A77A777CAGCCCCAAA7AC7GCC 

CCC7CACC77CAC7CCAACCACC7AC77GA7GA7ACAAGG7GAGACA777 

GGCA7G7GC77CC7CCA7G77CC7AGCA7777CCC7A7C7CC77AGCCT7 

CC77C7AA7CA7AAACGAAGAG7GAAC777CCC777CTAAAGGCAAC7TA 

C7CC7AGGACC7CGA7GCCA7AA7777G777C7C7AG7AC777C7A7A7A 

7ACACCAAACAA77AGC7CCAGAAAGG7AAAGAC7CAC7G7G7GC7CA7C 

AC7G7G7C7CC7AGCGCC7GGCACAC7GCAGG7GC7GAAGAAACACC7AC 

AGAA7GAG7GAA7GAA7C7C7CCC7C7C7AGAC7CC7TC7C7777G7AA7 

CAAACA7G77CAACC7GCAACACAG7C77A7GACCAA7CC7C7G77G7C7 

GACC7AGGC7GAGCTCCAGGGC7GGGACCC7GAC77CC77A77CACCACC 

7CAAGG7C7C7GCAC7CAC77C7C777C7GC7CAGGA77G77777C77C7 

7G7CACCAG7C77TTCTCAGACTTAGG7C7CAGC7CAGACA77GC7G7TG 

AAAG7AC77C7AC7GATCCTTTTA7C7AAAGCAGCCATTCCAGCCC7AC7 

C7C77GA7CA7AGCACCCTGAA77AAG77G777AC77AC7G7C7C77CAG 

GAGGGCAAGGAGC77GG7GG7GG7G77CAGGGC7G7ACCAAGC7G7ACC7 

7GC77CACCC7GC7ACAC77777AGCAACCA7C7AA7777ACA7GC7CCC 

77CAC7CG7CAGAAA777CC77A7777C7AC77CAAGCAGG7A7ACA7A7 

G7GC77C7CC7GGGAGGC7CACCCAC77CA7GAGAC7ACA777GG7CC7G 

GG7AGAAAG7G7ACAAAA7CCAC7GGC7CAG7777AA7CAA7G7A7G77A 

A7A77 AAC CAAC C7GAGA7C77GA777C CACG C C7GGC7AA7777G7A77 

777AG7AAAAACAGGG777C7CCA7G77GG7CAGGC7GG7C7CGAAC7CC 

CGACC7CAGG7GA7CCGC7CACC7CGGCC7CCCAAAG7GC7GGGAC7ACA 

GGCA7GAGCCAGCG7GCCCGGCC7AAGA7C77GA777C7ACCA7C7GAAC 

7C7G7A777GAAC7GAC7GC7CC7GC77GAGC77AC7GGCCAAAAC77GG 

CCCAC7CAGAC7CACGGAAG777C7GG77C77CCC7GG7AAC7777C7GA 

AC77AACCAC7GG777GC77GACAAGAGA77ACCA7C77C7CAC77CC7A 

GC7A7G7GAACTCACTTATCTGCTCTATTGCTGTTCAGTCTAGCACGGCA 
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C77A77GAACGAG7G7C7ACA7C7GCACCCCC7AC77C77AC7CA7CCA7 

7C7G777CAA777C77AAAAAGAAAAAAAAAAAGC7A77G7AAACA7ACG 

ATTACAGAAAATGATTTATAACATGTGTATGTACCACCTAGCCCTGTCAA 

G7C77AA7A777G77A7A777GC77CAAA7C777777CAGAC7G7AG77A 

AAAATTACTTAGGAGCCATTATTTATGGCCTATTTCCTGACCTAGTCTTC 

TTGATGGTCAATTTGCCTAATCATCTTAAGTTGCAAAAGCTTAGAATTAA 

AGgAAAGTACCTTCGATCCTCTGCTGTTGCCTTCTTTTTAATATTTGGGT 

T7G777GGG7CCCA777ACGG77G7GACA7CAGC77GAG7777GGGAGC7 

G7C77G77CAGAAAA7GG77C7GGGGAACAGCC77777CAAC77GGAG7C 

CAAAGTCTGTGCTTTTTGCTGAAAGCCATTATTGTTATGTTTATTACCAC 

TGGTTCCATTTGGTCTTATGCTAGGGGTGCTTGGAATGGCTGAATTAAAT 

CTGCCAACTGTCAAATTAGGCCTCTGGCTTACGGCTTTTGACTTTTGCAG 

TACACATGATGTCTGAGGTATACAAACTTGGCTGGACTTCTGATCTTGC7 

TGATGTTTGGATGTCTGTTGTTATATTCACCCTGAAGCAAACTGGGGTAT 

GTTCTGGGTTTGGTGTGCTTCACTCTCTGTTCAG7AACAGGGTATGACCG 

TATCTTAGTTTCATTTGGTCTTTCATATTGACTCCTATTAACCTTTATAT 

CTTTGATGTTCTTGACTACTGGTTTCTTTGATGACTGAACTTTACTAAGG 

GTCCGAATAAAGTGAGAGGGAACCGTCCTTGAGGGTTTTACTCCTGGTCT 

TGCAAGATCTGCTCCTCTAGAGAGTTGCTGTGATTTTACTGGGAAAGTCC 

TGCTTTGTGTTTCTCCAACAAATTGTTTATTAACCCTATCTTTCAGAACA 

GCACTATTAACTGAACTTTTGCCCAAGGCTTGTTTAGGAACTAAACTGTT 

C77GG777GA77A7AAGAG7CAG7C777GGC77AC77C7GG7A7A7AA77 

7AGGA7C7GGC77CC7C7CAGG77C7G77AAGA7A7C7AGCAAG77C7C7 

T7G777G777C7777AGAAAG77A7CCAAAGA77CG7777CAACA7GGA7 

A77A77CA7AAAG7C7A7ACA777ACCA777CC77GA7C7G77AAC7GC7 

GCTTTGTAGTTTTCAATTGCTCTATATTAAGTGACCCCACAGGTTTTCTT 

GACAGTCCTCCTGTGGTGGACTATCTAGCTTCACACTGTTGAAAACTCTT 

GCTGAAAAGCTTAGACTATGGGTTAGAAGAAACACATTTTGAAGTCCGCC 

TTTTGCCCAGAAGTTTTGGTGGCTCTAACTTCAGCTTCTGGGACCCTGCA 

GTATTAGGTGGTCTGGGCTGGAGTTTAATGCTGATGGACCTTTTAGGTTT 

GACAGGCAAAACAACATGGTTGGTAACATCATTTTTGGGTCTAATAGTCT 

GAAAAAACAAAGAAAATACATATTAAAAAATCCTTAACATATCTTATTGT 

TTTTAAAATAATAACTGTGTTTAACACATGCTAAAAAAAAAATCATTTTT 

AGAATTTCATCTAAGAAAGTTGAATCCTCAGAAAGTAAAGAAAGACTCAC 

TTGCTCTG7CACCCAGTCTGGTGTGCAGTGATGCAATCTTGGCTCATTGC 

AACC7C7GCC7CC7GGG77GAAGCAA77C7CCCACCCCAACC7CGCAAG7 

GGC7GGAC7ACAGGCGCA7G7CAC7ACACC7GGC7AC777777G7A7777 

7AG7AAAG77GGGG777CACCA7A77GGCCAGG77GG7C77GAAA7CC7G 

ACC7CCAG7GA7CCACGCACC77GGCC7CCCAAAG7GC7GGGA7AACAGG 

T A7G AG C C AC CACAC C7G7 C C7AACAGG7 AG77777 ACAAC77G AG77 C C 

7A7CAGAAG7A7AT7AGAA7C7777AGC77GACAGAA77AAGCAGAGA7G 

CAG7GAA7A7ACAAAAC77GC7C777CAAAAA7GAA777GCC7CAAACAG 

7AG77G77GAA7GCC7A77ATA7CC7AAG7GCCC7CCAAAGAACCC7GAA 

AAAA7ACA7ACA7AATGAAC7TA7G77AGGGTACC7CCCAACAAA7C7C7 

CC7AG7AC777G7A7AGCCACAC7A7A7G777777AAACCAC7GCC77TG 

7AAACA7CACAG7A7CAC7CAAGAACC7C7G7C7CA7CCCTGGAGA7CAG 

7GACAAGGAGA7AGG7GGCAGA7GA7G7GAGGCC7GAGA7A7GC7GCCAC 

AGC7C7CAA7AAACA7G7AACA7C77AA7AG7CA7A777G7AAAA7CAGC 

CAGGACAGGG7777AAGG77AGAG7C7A7G77AA7AA7AAACAAA7G777 

AG7CA7G7GA777AAG777GGA7AAGAAAGG7AGGAC7CGA77ACAGAGA 

A7777GAAAAC7AGGGAAGGGAG777AGAA77CA7A7GG7AAG7AA77GG 

GCAAGCCAC7A7GAA77CC7GAGCA7C7C7CA7GAAAGCAA77AC7CAGA 

AAGGAGAA777CACAGAGA777A7GGAA7A7G777CCAGGG7AAGA7A7G 

GGAA7GC7AGAG7TACCAC7C7A77777GA777GACAAA7A77G7GAAGA 

A7CAC7ACA7AAAC77GGCGAG7A7G7AAAGGA777C7AACCAGAACCA7 

77GGCA77GAGGGCAAAGAAA7G7C7AC7C7GGA7GA7AGCGG7G7G7G7 

GG7G77AC7AGGAG7GAAACAGCGGAG77GGGAG7GGGAGGCAGAGAGA7 

GGA7GG7A7ACCCACAA7GGC7A7A7C7GGA77AA7C777GAGCACCAAC 

A7TTA7A7ACACCTCGGATCTCTCCATCA77GC77AC7GAAGAGGTGGAG 
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GGACGTTGGCATGAAAGurrCCAAATGTGTTTTTTTAGTTGCTrteTlTAi-^ 

AT ATT AAAAACGAATTGATATAAT C CACAAAC CATAAAATTCACCATTTT 

AGTAAGTGCACACTTCTGTGGATTTTAGTATAGCCACACTATTATACAGC 

AA7CACCAC7G7C7AA7TCCAGAACA7A77CA7CACCCC7AGAAAGAGAC 

TTGGGTTTACTTGTTGGCAGTCCCTCCCCA 

>Concig38 

GG7CTACATG7GC7CGCAAGATTGGA7AT7GAAA7A7CAGCAAGAAAT7A 

AA7GACATAG7AG7CA7TA7GCC7AAA77A77G77A777777GA77GAAA 

AAAGTTGAATATTTCAAATATCAAGGTAGTAGTGAGATATAATAAAGAGA 

GAGTCAGTTCTAAGTATAGAATTGCTGATTCAGTTAAGCTCTGTTCTCCA 

ACATTTGGGCCACATTGAAGAGACCATGTAGCTGCTTTCAGCCTCGGTTT 

CC7CCTTTGCAAAA7GGGGA7TACAC7ACC7GCC7CACAGAGA7G7AAAC 

TTATGACATGTTATCATGATTGCCAGGGCCCACCTGTTTTCTTTTAAACA 

TTGAAATCACTGTGCCTGAAACAGGGATTTCCCTGCCCTTTGTGCAAGCT 

CCAGAAACAGGAGTCAGCCTGAGTCCCGCAGCTAAGAACGTGGATTCTGG 

TCATTTTCTCATAGCGAACACACTTCACAGGTCCTTCAAGGGAGTACATT 

TTCCTATAACTCACCTTAATCTCAGTTGAAGCCTCGTTTCTTATTTTGCA 

CTGTGGCCAAAAACTAAATCTCATTTCTTTCACGTAAACTTCAGCAATTC 

AA7AATAG7ACAG7CATTTTA7GT77CAAC7GAACCAAG7CAGGG7TCCA 

CTCCTGCCTCCCCTTTCTGCTCTGAGGACATCCATGAAGTGGAGGGGGTC 

TATGTAGCCTGGAGCTATTGGTGAGGGGCGATGGGTCCGTGGTGGT-CTTG 

GGGAACTGCGGGGCTGTGTCTGGCTGGTCTGGTGTCTGGTGATTGGCCTT 

G7TCCACGCGG7TCACGCTGCAGGACAGTTCG7G7CCTTC77G7CCTAAT 

GA7CAGCTTTTAGGCTCACGGGCC7G7C7CTGC7GAGA7A7GGAA7AGGA 

CAGCCTCTGGATC7TC7TTAAACTC7CC7GGGGCCACAGGGGAC7CTGTT 

TGTG7CTG7GCCCACA7AGGA7GAT7C7GCCCAGACC777GC7GCCA777 

CTTGC7G7TC7GC7G77T77AG7C7C7GGAGGGC77GCAG777CC77GGG 

GTCCC7GTGGAAGCAAAGCAAAGTCC7C7CCACGC7CAGA7G7C7AAACG 

7ATC7GGG7777A7CG7CCACCCA7CCCAGAGC7CAG7C7AGAGGAGGGG 

GCAGCCTTCGGG7TC7C7CCTTCC7CCCAGAGCC7C77CC777GCACCAG 

GGCAGCCTCTTCCTATCTGTTGGAAAGGGCTGTCTGG7TCTTGAATATAG 

AGTTGCAGG777GAGGGG7G7AGGC7GAGG7AAGGCAAAC7A7CACATGG 

AATAAAAA77ACCC7G7G7CAAGGAACAACCAGAGC7GGACAG7T777AA 

A7GTGAAAACCAA7T77ATTCAGGACTA7GGCGAGAGG7GAAG7AAGACC 

7CAG7A7AGAAC7GGGC7CAA7TCCGAA7GCAGCA7GGGCAAA7GGGAA7 

G7A7AGCC7AGGAGCAGGG7GGGAACC7G7GGA7GAAGAA77AC7AAAAG 

GGCA7ATCAGGGG7GAGGGGGCGTCCTGGCTACACCCAC7AAC7ACTG77 

GC7GAAGAAAGGCC7GG7GACA7CAC7GGGGAATGG7GGGGGA7GAAGAA 

TCCAA7CAGA7GGA7A77GAGGATAAGGGGA7C77GA7AAAC7GGCTTAG 

3AGGG7TT77GC7AAAAC7GGTTT7CA7AGG7AAG7CCACAGACAGG7CT 

TGGAGAAAG77CAGGGACCTACGG7T7G77CGGGCAGA7GC777G7CA7C 

7G7CACAC7GGCAC7GTCACCTGGC7TTCCTT7AG7CCC7CCCCCCC7T7 

7T7T77TC7GGAG7AG77TTGGGAGACCAGAGGAGCAGGGAG77AGGGAG 

AG7AG7CAGAAAAGGCCAGAGAAAA7AAGGAGGTG7C7G7AGGGAAAA7C 

C77AAAT C CT C7AA77AAATTAA7TTAA7T7 AT77A7 C7GGGACAAGG7 C 

7CAC7CTG77GCCCAGGCTGAAGTGCAG7GGTG7GA7C7CGGCTCAC7GC 

AGCC7CGACC7CAGGGC7CAAGCAG7777GCCACC7CAGCC7CC7GAG7A 

GC7GGGGCTCACAGG7G7GCACTACCA7GCCCGGG7AA7T777GGGTT77 

TT77T7TT777777T7TTTTTTTT7TTTG7AGAGA7GAGG777CGCCA7G 

TTGCCCAGGC77GG7CTCGAAC7CC7AAG7GA7CCA7CCACG7CGACC7C 

CCAAAG7GC7GAGA77ACAGGCA7GAGCCAC7G7GCCCGGCC7AAA77C7 

CCAA77TT7AAA7GC7TCCC7G77CCC7G7TCCAGA777GGGATA7TGAC 

7GC7G7TAAA7CAGCGA7TTC7CCC7G7GGAGAGG7AGCCAA7AGGAAGC 

AACAAGAG7GAGGAG7CCT7A7A7CGAAA7AGAGGG7AAGAGAAGAGACA 

GA7G7TATC77GGCAG7GA7TTAAGAACAGCGAG7C7G7AAGCAAAGCAA 

AGCAAGGCTCCCAGG7GCTGAGAAACAATGGCT7TC7GGGGAAGCG7C7G 

7G7TCAGAACC77AAG77GGAAACA7CTC7GAAGA7G777GCCA7GAAGG 

TTTTCTTCTGAAGT7GAG7C77TCATCAC7AGG7AGGCG7G7777GGAGT 

CTCTATCAAACAGA7CC7G7G77TA7TAGGAAGC7G7GG77CA7AAAGCC 

2CA7GCTAAT777GCAGG7AGCAGGG7GGCCC7GGCC7GACCCGGGGACA 
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GAG7GGC7G7CCTCCC1 _CAGGCAGGAAACTCTCTCCTGCCACCT AGT^-r : 

CTGCATACCCACATTTCAAGGGAGCTTCTGGGTGGTGAGTTTACCAGACT 

ATGG T C T G AGGT AG AGTT AAGCAAAAC AAAAC T AAACTG CAT AAAG AAAC 

AGAAAGAAAATCAGGTGTTATAAAAACAATTTGGCATTTGTTTG7GTTTC 

AGCTCCG7GTCGATTTATTGCTTCCACAAATAG7GCCGATATGCACCAGG 

CACTGTTGTAAAACTGAAAATATGTTTTTGGATGTGCCCAGTCTGTGAGT 

ATTAAACGATGGTTGATTTGAAATTTGCTATGATTCATATTTC7GGGGGT 

AAGATGCAGGATTTCTTTGGGGGGCCTACGATGTGGCATTCTAGAATTCT 

CAAAGAATCAACCCTGGTGGGACCAGGAAGAGCTGAGCTGAGGCCTCTCT 

GCTCATG7GTACTTACTGGAGATCATGGAGACAGGTGAGCCTGAGTGCAC 

G7C7CACCAAAGCCACAGCAGAGGGGGAGGAGGCGGAAAGAGAGC7C7C7 

CCA777CTGAGAAG77AA7GG7AACAA7GGCATACA7ACC7AC777ACAG 

T7GAAA77GGAAACCACAGCA77AAG7G77TCCAA7GAAA777GGCAA77 

TGGGAG7777C7GAGC7GCA7TGGA7GTGGT777GCA7GC7G7TAGGATG 

AGCAAGAGA7GA7GGAGAACA7C77CCTT7TGAGCTTCC7CT7GGACG7G 

GG7CAC7CCCAC7CA7GGAA77AGAAAGC7TAGACC7AGAC77GAA7C7C 

ACC77CTCAAGG7GC7CCCGGGCAAATCACT7AAGA7CCA7CT7C77C7C 

CTCC7GC7CCT7C7CC7CC77C7GAG7777T7T7777C7T7CCAAAATTC 

AAATGACACGGTACTGG7AGAAGAAAAGGTCCAAGTCTGC7T7TACAGC7 

CCCC7CATCCCCAAATGTACTCCGACCCCAAGATGACCATGT7A7CATTT 

GAT7GACA7CC7TCTAGT77CAAC7CATT7CTT7GCATG7ATA7GCACG7 

ACATA7ACAC7A7777AT7T7GCCAGGGGTCACCG7T7AGC7GCA77AAT 

77C77A7AAAATAATC7ATATTTAC77ATGG7TTACGTAAAACAACATAC 

ACATG7AAG7G7A7AGCTTGATAAG7CTTCACTGTAAACCAAAAA7AAAA 

77CGAAGCCCCCCCAACCG7C7GAA7GGACCCC7C77C77GGCCAAGAGC 

ATTCCAAAG77AACCTGAAAAAAC7AG77CAGG7CA7GA7GGAAGGGAAG 

GT7GGACA7GCCCCAGTA7ACCC77C7CCC7777GGAA7TCAGGAAAAGC 

TGACCAGCA77AACA7CAACACAGACC77ATG7C7GATAGGAAAC777GA 

CAATCTA77CCC7CTGAAGC77GC7ACCCGGAGGC77CA7C7ACAAGA7A 

AAACCT7GG7C7CCACAACCGC77ATCA7AACCCAGACAT7CC7T7CTG7 

7GAGAATAA7TTACCTTGTAACC7GGAAGCTCCC7GCTTCAAG7TCCC7C 

ACCT7TCCAGAT7GAACCAA7GTAAACC7TACA7GCAT7GA77GA7G7A7 

TATG7CTCCC7AAGA7GAATAAAAGCAAGC7G7A7GT7GAC7GCC77CAG 

CACAGG77G7CAGGACC7CC7GAGGC7GGG7CACGGATGCATCCT7AACC 

T7GGCAAAATAAAC7GTC7AGA7TGACTGAGACC7A7C7CAGA7AC7G77 

GGG77CAAA7A7ATAACT7A7GAAAC7AA7ACACAAATCAAG7CA7AGAA 

TATT7CCA7CAC7CC7CA7C7ACCCCCAAA7T7CC7TATGCG7C777GCA 

G7CAACC7CCCACCCCA7CCCCAGGCAAC7GCAGA7C7AC77777G7C7C 

7GCACC7TCAAC7GACCC7T7C7G7GA777CA7A7GAA7GGAA7CA7GCG 

C7GAGCAG7C7777G7GTC7GGCT7C7T77GC7CAGCA7AATG77777GA 

GG777G7CCA7G7TT77G7GT77G7CAA7GG77AA7T7CTC7CCA77GCA 

GAG7AG7777C7ATTG7ACA7G7GTACCACAATTTG7A7ATCCA77CCA7 

TGCTGA7GGACAT7TGAT77G777CCAGATTT7GGCAAT7A7GAA7AGAG 

C7ACCA7GAACACCCAGG7ACAAG7C7T7GTGTGGAC77ATG7T7TCA77 

TC7C77GGAA7GGAACTG7CATA7CAATAAGTA7ATG 7T7A AC 7T7 G7AA 

GAAAC7GACAACAAAT7ATC7GCGATGGTTATGCCATT7TG7T7TTCTAC 

CAGCAA7ACACGAGCAT77CAG7TGC7CCACAACTT7GCCAAAAC77G77 

7TCT77AA7TTGGACA7T7AAGTGGTGTACAGAGGCATCTCA77G7GG77 

CTAG7777C777GCCC7GATGACCAA7GGTGT7GAACA7CTT77CA7GTG 

CTTTT7G AC CATTTACATATC C7C7TTTGTGAAG7G7CTGTTCAAATA77 

7TTGCCCA777AAAACATTTGGGGG77TGTCT7A77A77G7G77GGGAGA 

GTTCCA7A777A7TTA7TTA77GAGA7GGAGTC7CACTC7G7TGCCCAGG 

CTAGAG7GCAG7GGCG7GATC77GGC7CAC7GCAACC7CCAC77CC7GGG 

TTCAAGCAA77C7CCTGCCTTAGCC7CC7GAG7AGC7GGGA7TACAGGCA 

TG7GCCACCACACTGGCTAAG77777GTA7TT77AG7AGAGATGGGG777 

CATCA7G77GGCCAGAC7GGTCGCAAA77CCTGACC7CAAGCAATCCACC 

TGCC7CGGCCC7ACAAAG7GC7GGGAT7ACAAGCATGAGCCAC7G7GCC7 

GGCCCATA777A777777A77C777AT777GTA7ACAAG77C77GGTCAG 

A7ACAATAA7ACC7GG7CAGA7GAGA7AATGAG77GGAAAA7GC777GCA 

AATGGGGGAGAATAA7TTAAATG77ATTTA7TTAT7AAGAGCAGAGGCCC 
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TTCCTGTTGCGGTCAO.^AAGCCGTTTGCTTCTTCTGCCTTTI^TA^wJ 

AGCAGAG7CGAGC7ACACAGGC7G7C7G7G77GGC7GC7A77AG77AA7C 

AGAGAG7777777777C77GCC77G7CA77C7AA777G7GACACA7AA77 

AGCCACAA7A7G7G7777CAG77G7GACAC7GGCC7GGGAAACCAAGGGA 

TG777AGAG7GGA777CC77GA7777GCAA7AA77G7G7G77777C7GCA 

TC77C7G77AAAGACAAA77CA7GGAAGCAAAACA7GGAAGCAAAG7ACC 

CTGGACATCCCCCCTTCTTTATGAAATTGAT7TC7CTTAAATG7AATGTT 

7SC77G77CCC77AC777AAAAGCAA777AAGAG777A77GAGAAAG7GA 

GCCC7GGAAACA7AGA7GCA7AGAGAGAAAA77C7ACCACCC7CAGG7CC 

C7A77G7C77C7C7CA7AAAG7G7AG777CAGGGCC7777AGAAG777C7 

T77C7GC7C7GA777GCA7G777G7GAG7G77GC7A7777AAG7A777GG 

A777GG7C7GCAAA7CC7A7GAGAGA7GGCAACAGAG7AGGGA7C7CAAA 

GCCTGCAGGTTGTATTAAGTCCAGCAGGGCCTTGTATTTACAACAGAGGG 

TCCTTGAAGACATTCCATATATTATGCTAGGGGAGTGGCCAAGCAAACTT 

TAATGTGTCCCT ATGG TGGGATATTTGGGGTTAATACCTGCCCTTCTCTT 

AA777C77777C7777C7777777C77777C777C77777777777GAAA 

7G7AG7C77GC777G7CACCCANGC7GGA77GGAG7GCAG7GG7A7GA7C 

7CAGC7CAC7GCAACC7CCACC7CC7GGG77CAAGCAA77C7CC7GCC7C 

AGCC7CCCAAG7AGCTGGGACTATAGGCACACACCACCATGCC7GGCTAG 

T7777777777777777GAAACNGAA7C7CGC7C7G7CGCCCAGGCGGGA 

C7GCGGAC7GCAG7GGCGCAA7C7CGG 
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7AAA7GG7GCC7TTGTCTCCCAGT7CAC7CCAGCCACCCA7CC7GGGGCT 

GCACCC7CC7CCTTCCAAGCCCTC7CCCT7TCC77CCTGGTGC7GCCTGT 

CATG7CAAGCATATGCATCAGTGCGACCAGGACATTTGAAATGCAACCAG 

7ACAA77GGGCGCGGTTATGCCTACCAGTTTTTCTTCCTTAAACA7TTTA 

7A777A7G777GAAAGCA7GCCACC777C77CAC77GCCAAC77GACAGA 

777A77AG77GACAACA7CCGC7GA7AGCA7CAG7AA7AAG77AA77G77 

7TTGCACA7G7AGC777AA77A77C7CA77A7CA777A7AGGAG77A77C 

777G7AAAGGG7AAC7GAG7777CCAAAACAAACAGAAA777GGGG7GGG 

C C CA7GG AGC G7GAC7CA7G AAA7 C AG A77C77AGAAGG AC C7CGGCAAG 

7C7C7GGG77GC7G77AA7GAGCC7GGC7GGC7GCCAGGGG7G7G7C7GC 

CC777A7GAGGCCACCAC7G77CAAA7GC77GCC7GCAGCA77AC77GCC 

7AGG7AG7GC77G777C7AC7GAAC7G7CAGGGA7CCAA77C777G7GG7 

C7AAG7AACAA7AC7CAGA77CACAAGGAA77GA77AA7AAGCCAGAA7G 

CCAA7G7A77ACA77777GA7GAAGACCA7A777ACAG7GA77G7A7C7G 

C7CAAGC7CAAA77AGGA77AGAG77C7GACAAA7ACA7A7G7GAGAAG7 

A7GAGG77AAA7AC77GAAA777GGAC7777C7AGAAAA7C7GAA7G7GA 

77GC C A77 C ACA7AC C777 C7GGGG A7G A7GA77 C77G7AC7777A7777 

AAAAGACA7AGAAAAC7AAC77AAGAA7CAGA77GC77GGC7GGGCACAG 

7GGC7CA7GCC7G7AA7GCCAGCAC777GGGAGGCCAAGG7GAG7GGA77 

GC77GAGC7CAGGAG777GAGA7CAGCC7GGGCAACA7GG7GAAA7CCCA 

7C7C7ACCAAAAA7ACAAAAAAAAAAAAAAAAACAACCAAAAAGAA7AAA 

77AGC7AGG7G7GA7GG7GCG7GC77G7AG77CCAGC7AC77GGGAGGA7 

GAGG7GGAAGAA77GC77GAGCCCAGGAGGTGGAGGTT7CAG7GAGC7GG 

GG77GCAACAG7G7AC7CCAGCC7GGGCGA7AGAG7GAGAC7CCG7C7CA 

AAAAAAAAAAAA7CAGA77GC777A77GC7GG7777C777C7AAAAC7GA 

GA77GGG7CCCA7CA7CCCC7GGCCCCCA77GG77AA7GG77CC7CC777 

G7C7A77GAA7AAAA7ACAGA7G7C7GC7777GGCAACA7GG77GAA7G7 

AGACAC7GCAGGG7C77CC7GAC7CAAAA7GAG7AAGGC77AGA7AAAAC 

ACA7777GAAA7GCA777C7GGA7GAACAGCAAGGAAAGGAGA7C7C77A 

AAA7CC7C777C7G77CCCC7C7CCC7ACCCCC7CCAAG7GGGC77AAG7 

AGGAAGGG7GG7GAGCGGCAGG7AAACACACG7CAAAGGCAG7C77CC7C 

7C7GAGGGAAAACAC77G7A7AAGCA77GCAA7CAA7GGGCC7C777AA7 

7A7G7GCCAG7GGCAAGAGCGGG7GC7GAACCCAGGGGCC7GCC7CAA7C 

CGGGGC C777GAGGCAGAA7AAAG7GG7C7CAGG77G77GGCA777CC77 

GCCC77CCACCCGAAGCAGACACAAA7CC7C7C7GGAGGCAAG77CCCCA 

ATTCAGCCAGTACAACTCCCACAGAC7AAGA7CAATCATGTACAAGC7CA 

CAGACAAAGG7CACCAAACACACAGAGCAA7AAACAAATTCATGAG7GAC 
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3TGAATGAGAATAAAC^^*AA.CAATAACCAC 

C77CAGCTG77AGAA77CC7GAA7A7AGAA7AAAAC7GCCACAA7GGCAA 

ACA7GCA7CTAG7AC77AC7G7G7GC7GGG77C7AAGAA7777GCACA77 

GTGCCAGATACCGACTCAGCTTCACACTCACCCTCCTACTGTGCCCTCTT 

AATTTGCACTAGATTAAAAGGTAGAAAGGAAGAGGCAGCTATTCTGTTCT 

TGGCTGTGCCTCTGGCAGCACATGCAAAATGGGCAGTAACAGTGGCAGTC 

ACAGGTAAGTAGCCTTCTCACAGTGTGGAGTTAAAGGCATGGGACTGAGA 

CGAGCAAGG77CC7AAAGGGAGAG7GGCCAG7AGA7GACCAGGGGC7AC7 

GGAG7GGC7GCA7GGC7C7G7GGAAGC7CAGAGGAGCC77GGG7CC7GCA 

GG7GCAG7AGCAGC777C7G7AG77CC7GA7C7C7GGG7CCCACAA7C77 

CCCCGTTTTTGCTCCTCCACTTCTAATTTTG7AACTGACTTCCCTGTGTG 

TAC77CTC7C7C7GA77GAAA7AGCCAGAC7GG777C7G777CC7GA7AA 

GACATTGTCTGGTACGAACACAGTAACTCATTTAATCCGATATCTCTATG 

AAGGAGGTACAATAATTATTCCTATTTTACAGATGAGGAAACACAGCAGA 

GAAATAAAGTCAATTGTCTAAGGTTGCACATTTAGTCAAGGGAAGGGTTG 

ATATAACATATAATTATTTAGAAAACATCTAAGGAAATAAAAGGCATAAT 

TTAAAAATAAAACTAGGCAGGTTTAAAAAAATGAAGTAATCTATAAGTAA 

AAAAGTATAATTGTTGAAATACATATCTTAGTGGATGGGTTAAATAGCTG 

AAGAAA7GA77AA7GAAC7GGAAGG7AG77C7GAGGAAA7CAGAA77CAG 

CATAGATAGAAAAAATGGGAATTTACAAAAGTACACAGGAATTATAAAAG 

AGGTTAAATTATAGGGAGGGTAGAATGAGAATTAACATTGGTCTAACTGG 

AATTTTGGAAGAAGAGAATAGAGAGAATGAACAAGGCAATATTTAAAGAG 

G7GGC7GAGAA77777CAGAACGAACACAAAC7A7GAC777ACCAG7AGA 

GAAAACAATGTACACTGAGGAGGATAAATAAATATACTATGAACAAATTG 

TAATAATAATACTCAACAAAGACAAAGAGAAGATGTTAAAATCAGCAAAA 

AAAGAAAGTCAGACTTAGAAAGAAATGACAATGGCAGACTACTCAACAAC 

AACAA7GGAA7CCAAA77CGG7CAAACAG7A7777C77CA7GC7AGCA7A 

TAGC 

>Contig4 0 

GGGAGTCCGCTATGCTCCTAAAGATTTGCACCTCTGATCTGGTTTGTAGT 

TAGTCTCTTTTATTGCTTTATCCTACTCAACTAATTTTTTTAGTGCCTGT 

TTTTTTTTTTTTTAATGTGTGTTGATGACTACAATTCTAAACTCATTCTA 

CTGATTCATGGGTGCTTTAAAATCTGAGCAGTCTTTCGCATTTACTGCC7 

G7GA7GGCCCA7CCCACCAGC7AAAG7G7G7GGCCAC7GC77ACAGCACC 

A7G7GA7AACGAG7AAGGGAGAGA7GCCGCCCAGAC7C77C7AGGAGCAG 

CCAG7AGGACC77CCAGGGG77GCAAGCAAACCACAGCAA7A7G7GGAG7 

G7GGCAGAGGA7GGCCCCAAGAGGA7G7GGCAGCGGC7AG7GCAGC7CAG 

C77AG7C7GAGAGGAAA7GC7GGAGAGGAGAGCCCAG7C7G7ACAGGCA7 

GACAGCCACAAGGAC77CAACAGC7AACA7GGC7GAG7GGAC777A7G7G 

C7A7C7CA77CAGAAAACAGGAGCAA7CAGAAAGGAG7CACC7CC7A777 

G7ACCCCAGGAA77GC7AACC7AC77GCA7C7GAA7GA7G7CCA7CAC77 

CCC77CA7CACC7CC7C7GGGGGC7C7GCAAGGA777GAC7CC7GCA77A 

G7GA7C7G7C7CACC7ACG77G7GA77CACA7GAAC77AC7AA7G7GC7A 

7G7GACAAC7ACCA7C77AAACACAAAAACCC7C7777GA77C7G7GGC7 

CCC7CCAGC7ACCCC7GCA777C7C7G7CCCCC7GCCCCG7C7C7GCAC7 

CAC7777A7777ACAGCAAAAC7AC7CAAGGGAG7C7CAG7GC7CC77GG 

C7CCA7G7C7CCACC777CA77C7C7CC7CAG77CAC7CC7G7CAGGC77 

CCG7CC7CAAGC7C77C77CAC7777G77C7AGGGCCGC7GACA7CC7C7 

77C77GCCAAA77CAG7GGCCAGG7CC7CAC77AC7CAAC7GC7CAGCA7 

TG77GGGCC7GG7GGACCACA77C7CC77CACCCACC7777GC7GC7C7C 

7C77C7C7CCAGA7G777C7C7C77C7CAC7GGC7AC7CC7C7777G7C7 

CC777G77AGC7CCA777C77CC77CCAACC7CAC7G7GC7GG7G7GCCC 

AG7GC7CAG77777AGC7A77C7C7C7777CCAG7GGCA77CA77AGA7G 

G7A7CA7G7GACCCA7GGCA77A7A7GCC77C7ACA7GACAG77AC7CC7 

GAA7A7GAA7C7CAGGAAAGA777GGA777A77777AA77AA7777777A 

AA7777A7777AA7AAA7GAGG7C7C7C7C7G7CA7CCAGGC7GGAG7G7 

AG7A77GAG7GA7G7GA77A7AGC7CAC7GCAGCC77GAACCA7GGGC7C 

AAG7GA7CC7CC7GCC7CAGC77CC7GAG7AGC7GGGAC7ACAGGCA7G7 

GCCACCA7GCC7GGA7GAC777T7G7G7G7G7G7G7G7G7G7GGAGACAG 

GG7CT7GC7C7A77GCCCAGGC7GA7CACAAAC7CC7GGCC7CAAG7GA7 
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CCTCTCACCTCAGCCTw .CAAAG7GC7GGGA77ACAGG7G7GAGACCA. J ? 

C7GGGC7AAGA77CAGA7777G7A77CAA77GAC7G777GACA7C77CAC 

77GGACACC7AAGAGG7A7C7CAAA7A77AA77AAC77GGCCAAAA7ACA 

GAAC7777GACCCC7GCCCCCACAA7AC77GCCCC77CCCCAGAC77C7C 

CA777C7G77AAA7A7CCCCAG77AC7CAACCC7CAAACC7A7GAA7GCC 

C777GA777C777C777CCC7CA7C7CC7ACG77GACGCCA7CAGC7AG7 

T77G77GCC777A7GCCCAGAA7A7AA7CC7CACCACC77C7C7CC7A77 

GCCCGAG7A7AAGA7G7CAG77777CC7GCACAG7CCA77GCCC7GACC7 

CC7GAG7GG777GC77CCAC7777GACA777G7A77CC7C777CCCCCAG 

GG7CAA77777CACAGCAAGAG7GGCA7777777777777777777777G 

AGACGGAG7C7CGC7C7G7CGCCCAGGCCGGAC7GCGGAC7GCAG7GGCG 

CAA7C7CGGC7CAC7GCAAGC7CCGCC7CCCGGG77CACGCCA77C7CC7 

GCC7CAGCC7CCCGAG7AGC7GGGAA7ACAGGCGCCCGCCACCGCGCCCG 

GC7AA777777G7A77777AG7AGAGACGGGG777CACC77G77AGCCAG 

GA7GG7C7CGA7C7CC7GACC7CA7GA7CCACCCGCC7CGGCC7CCCAAA 

G7GC7GGGA77ACAGGCG7GAGCCACCGCGCCCGGCCAAGAG7GGCA777 

T7AAAACCA7A7A77AGA7CA77GC7777G7G777GGGAACC7CCAAGGG 

C777GCA7CA7A7A7CAAG77GACACC7C7CC7ACCCAAGCC7GGC7C77 

TCC7GCTCC7C7G7CC7C7CAGCCCC7CCACCCA77G77CA7GC7GC77C 

AGCCACACTGGCC77C77GCCA7GCCACA777G7GC7AAGCCCACA7CCA 

A7C7CGGGGCC777GCAC7CGCA777CC7C7GC77GGCA7GC7G7ACCCC 

AGA7C777CA7GA77GGCAGC77C7G7ACA77CAGCCACC7GC7CAAGCC 

ACCC777CAGAGGGCC77CCC7GGCCACC7CACC7GAAA7AGCACC7CCG 

A77GCACCCA7CCGG77A77C7CCA7CC7G77C7C77GC77GG7GA7777 

C CA7 CAC7GA7GAGGAAA7GAACCA7GGAA7GC7AGGGC7GA7GAC CAGA 

AC777CCCCCACCCCCACA77A77ACAGAGGAGGAAA7GAGG7CGGAGG7 

AAGA7GGGCCCAGGA777C7AC7CCCGCC7GGAC7GCAGGCACAGCAC7G 

ACC7CAGC7G7GC7CAC7C77GGCA77CACCCAACCC77C7A7C7CCAAC 

7GCCCCA777ACCAGAAAG7GAAA7G77C7CAGAGACGG7GAGCCACC7G 

AC77GGACAGCAGCCCAGGGCCCC7GGCACCC7GC777C77CC7CCC7GC 

CA7CC777CC7C7CCAAGACC7ACC777CCC7G7GA77C77GCCCACA7G 

C7GCA777CA7GG7777A7GACC7GA777C7GAGAGGGA777GAA7777C 

A7GA77A777A7G7AAGCAAA7CA77A7GC77A7ACAAA7GAGAAAAGGA 

G7GC77C7GGAC77CCCAGGGACAAAA7C77G7CAC77GGC77GC777CA 

7A77GC7AA77AAGGACCCAGGA7G7GGG7GAGA7G7GC7AAAAGC7GAG 

AGGAGGC7C7GGAC7C7GAC7A7GGGCCCACACCCC7GGGCAGGCA7CAC 

AC7AG7CC777AGG7CA7CC7CAACCCAGC77CCAG77GAA7CAGA7G77 

7G7GAA7AAC7CAGCAAGGC7G7A7GGGAAA7GAAGAA7GAGG7GGGGAA 

GAGGCC7G7GCAGAAGACACAC7GAC77ACCCC7C7ACC7C7AAC7AGGG 

TG77G7AGCAGCCACCCACCCACCAAG7C7G7C77CCAGACCACG7A7GC 

777CC7CCACC777GCA7C7777A7C77C7GCCAGCCCAGA7GC77GC7G 

AC7CCAGCCCAAGCC7A7AGGA7AAGC7ACAGCC7G7CCC7ACAGAC7AC 

GCA77GCAGAA7C7AAGACA7CAAG7CAAG77CGGAAGCAC77GCC77C7 

CC7C7CCAGG7ACACAGGC7C7CC7GGAAAGC7GG7AGCAGC7G7GGAGG 

TG7GG7G7G77ACC7GC7GCAGG7GCAGAGAAG77GAC77CACAGCCC77 

CAGAAAGAC7GCC77C77CCAG77G7A777G7G7AC77GC77GGG7G7GG 

GGAGGA77C7CAGC777C7CCAC7CAAA77A7CAGACCC777CCA777AG 

7GG7AGACCA777CCC7CG7CCAGGCCAAGGGCACA7AG7ACAGAGAAA7 

AGGGAG77G77ACCCAGGGAGAGAAC77GGC7C7AAACC7G7AA7AGAAA 

GG7CAG77C7GG7C7GGAGGG7CAA7777GA7C777GGC7CAGA7CCAGG 

AA77GG AAC CAAGGC7777GAACA7777AA7GCAGGGGA77AAAAAAA7G 

A7ACG AG7 C A77CACGAA7A7A777G C77AACA7C7AAAGAGA7 C CC7CA 

AAACAC 7 AG AAAAAA7 AAGAACAAAAA7 C 7 AA7 AAAACAAAAT77G 77 AA 

ACACA777ACCAAA77777777777GG7AAAAA77CAAA7G7CA7AAA7A 

AAGC7AAAG77CC7C77GA7GAC7CGC7CC7C7GCCC7A77CCAC7CCAA 

G7AACCAC7A77A7CAG7C77GCCAA7ACCC77CCAGACC7C7C7ACC7C 

7A7A7ACCA77AGAAGCACA7GG7777GCA77GAGGA7G7GCAG7G7777 

G7777ACG7AAA7G77A7CAC7C7G77C77G77CCA7AA777GCC77777 

C7C7GAA7GA777GC77GGC7A7C777C7A777CAG7AGCA7C7CC777C 

T7777 AAC77 ACCA77G777 A777AAC C77G C C7 C7 A7 C AACAG A7A7G7 
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AGG77G777C7AG77GA * 77CA77AAG7A777A7AAACAACG : CA7CAG1 A i? 

3A7G7CCA7AAA777C777ACGGAAGA7GGCAAG7AG7GGAA77GC7GAG 

CCAAAGAACA7G777AAAAAACCCAAAAAAAC7AGACGC7ACCAA7777C 

TC7CCAAAA7GGCCA7ACCCAC77ACCCA7ACAGAGA7GA777GGAA7C7 

GGC77CC7CACAAGG7GAGA7GCC77CACAG777CA77C77CC7GGCA7G 

7C77CCC7777G7A7C7GAGAGAGC7GGCAGAA77G7G7CAC7AAA7CAA 

GGA7AGAGGG7CAAA7GACAGC7CAAGC7CACAGGCACC7C7GC777C77 

CCCAGACCACC7GC777CC7GCCACCAGC7C7G77CCA7C77A7AGAA7G 

G77GCCAC77GGG7G7C7GC7CCGACAGCCA7G7CA7CC777GCAC7GCA 

G77A7GAAGCAGACAGAGC7AGGAGAGGGGC777GCCAGCC7C7GCCC7A 

GC77GGAGAA777CAAAGAAGGAGGG7A77GAGAG7GAGC7GCCGAAGAC 

7GGCAGC7CCC7CAAC7CAACAG77G7CC77CCACAAGAAG7CAGA7ACA 

T777777GGGA7AAAA7A777AAAAA77A77A7777A777C7GAA7AA7A 

7A777ACA7GA77CAAAAA7CAAAC7G7AGGCCAGGCA7GGC7GC77A7G 

CCTG7AA7CCTAGCAATTTAGGAGGCCGAGGCGGGAGGATCACTTCAGCC 

CAGGAG77CAAGACCAGCC7GGG7AACA7AG7GAGACCC7G7A7C7ACAA 

AAA777AAAAACAAAAA77AG77GGGCA7GG7GGC7GA7A7GG777GGC7 

C7G7GAC C CAAC7CAAACC7CA7G77GAA7777AA7C C7CAATG77GAGG 

GAGGGTCC7GG7GGGAGG7GA77GGA7CA7GGGGG7GGG77CTCCC77GC 

7G77C7CA7GA7AG7GAG7GAG77C7CACAAGACC7GG77A777GAAAG7 

G7G7AGCACC7CCCCC77CAC7C7C7CAC7C7CC7GC7CCGCCATAG7AA 

GA7G7G7G7G777CCCC777GCC77CCGCCA7GA77G7AAG777CC7GAA 

3CC7C Z CAGC7A7GC7TCC7GTACAGCC7G7AGAAC7G7GAA7CAG77AG 

ACC7C7777C77CA7AAA77ACCCAG7C7CAGG7CA77C777A7AGCAG7 

G7GAGAG7GGA7GAA7A7AG7GCCA7A7G777G7A77CCCAGC7ACCCAG 

GAGGC7GAGG7AAGAGGA77GC77GAGCC7GGGAG777AAGGC7GCAG7G 

AGCCA7GAC7G7ACCAC7GC7C7CCAGCC7GGG7GACAGCGAGACC77G7 

C7CCAAAAAAAAAAAACCCAAAC7G7G7AAAA7G7G77CA7AAAAG7G7C 

77GC7CCCACACC7G7CCC7A7A7A7C77A77CC7CAGCC7CCGACAAC7 

ACT77A77CA777C77A7G7A7C77C CAGAA7CAAAAAAAAAAAA7CAAA 

7ACAAGCACAG7GGAA7G7A77GCCC77C77CCCC7CCC7777G77ACA7 

CAGAG77AGCA7A7CA7AAA7ACGG7C7GCA7777C77C77777CAGC7A 

7CAGCA7G7777GGAGAGGA777CA7A77CG7GCAGACAGCATG7A77AG 

7CAG7CC77GCA77GC7A7AAGGAAA7ACC7GAGAC7GCA7AA777A7AA 

AGAAAAGAGG777AA77GGC7CACAGC77CGCAGGC7G77CCACAGGAAG 

CA7GGCAGCA7C7GC77C7GGGGAGGCC77AGGAAGC7777AC7CA7GCA 

GAAGACAAAGCGGGAG7GGA7G7C77A7A7GGCAGGAGCAGGAC7GAGAG 

AGAGAGAGAGAGAGAGAAAGGATGCCACA7AC7777AAACAACCAGA7C7 

7G7GGGAAC7C7G7CACGAGAACAGCACCAAAGGGATAGTGCTAAACCA7 

7CA7AAGAAC7CCACCCCCA7GA7CCAA7CACCCCACACCAGGCCCCACC 

7CCAACA7CGGGGA77ACAA777GACA7GAGA777GGGC7GGGACACAGA 

ACCAAACAA7ACCAGAG7GC777C7CA77C7777C7A7AGC7GCC7AG7A 

77C7A7G7CC777AC77CA777AGGCAG7C7C77G77GA7AGACAC77GG 

G77AC77CCAA77777CC7A77ACAAA7GA7G7GCAA7GAA7AA7777GA 

7CA7777CCA777CACA7GGG77A7G7CCA7C7G7GGGA7AAA7C7CCAG 

GAG7GAAA77GC7GGA7CAAAGGGGAAG7GCAC77G7GATT77CA7AG77 

AGCAAA7777G77C7A7AAGGG7CA7A7GAA777A7AG7CCCACGCG7AA 

7A777AACAG7GGGGA777CCCGACAG777GACCAACAAGG7C7G77G77 

AAAC7777GA77777G7CAA7C7GA7GGGAAAA7AC7AG7A7C7CAAAG7 

GC7777AA777GAC777C77A77ACAA7G77AAGCA7CA7777AC7C7GC 

CCAAGA7CAAA7AG7A7777C7777C7G7GAACAGAC7G77AAGA7CCC7 

CAG77C777ACA7G7GAAACAAG77A7C7C777A7C7GGGG7G7GAG77A 
CAAC7AC7777CC7C7GGC77G7777GCGC777GAC777GC77C7GG7GA 
T7CCCGCAA77C7GAAAG7G7AC77777GCA7CA77CA77C77A7ACACC 
CA7GC7C77G77CACGC7GG77CC7C7ACC7GAGGGC77777C7777C7G 
C77C7A7C7GGGAACA7777777GAGAGAGAG7C7CAC7C7C7CGCCCAG 
GCTGGAG7AG7GCAA7GGCGCGA7C7TAGCTCACTGCAACCTCCACCTCC 
7GGG77CAAGCAAT7C7CCTGCC7CAGCC7CCCAAG7AGCTGGGA77ACA 
GGAGCCCACCACCAAGCCCAGC7AA777G77GA7T7AT7TAT77A77777 
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GGCTCAAGCGATCCACCCACCTCGGCCACCCAAAGTGCTGGGATTACAGG 

CGTAAGCCACCATGCCCAGCCCATGTGTGGAAATCTTCTGTTTATCCCTT 

TAGGCTTGATTCTTATGTCGTTCTCCTCCCTCCTTCCTGGATACTCCTCT 

TGTTCTTTATCTTACTCTACTTGTCATGTTACCTTGTTTCTGCTTATAAC 

TAGCTGCCTCTCCTATCTGAGGAGGGACTTGTGACTGTTCTCATCTCTGT 

AC7CGCAGCTCCTAGTACATAGCGCTTGCTCAACAGATGTTTGGTGCATT 

GATAGATAAATCACTGGTAGCTGTTACTACCAGTCCTGACTCCCTGCAGT 

GCTTCAGCTGATCCTGTTCCAGATGTGCACTGAATATCCTTCTGTTGAAC 

AACAGAAATAAAGGGGATGGGTGAGGAGGATAGTCTTCGGTGGCCAAGGA 

TATTTTTAGGTACTTTGCAGCACTCAGCAATGAGGAGTGGGCTTTAGTCC 

CCCAAGAACTCTCACAGCCCTGGGTGTCTTTACTGTTCAGTGTCAAATCC 

AAGACAAGTCAATGATCAGGAAAGACCATTTTTTTTTGTTCAGTGAAGTT 

TATTTGAGAATCATTGAACAGTATGATATTTGGTAATTTCATAAATATTC 

CCACTTAAAATGATCGGAGCAGATATATTTTCAGTCGTAATTAAAGGACA 

TGATTTAAAGAGAGCACACCAGTCCAAATTGAAATGATTCCATAGCTATT 

AAAAAACTAGGGTTTTTTACAGACAATGATAC7TTTTGCCCCCTTTGAAT 

AG ATT AG AC C AATG AAT AAAACAAACAAACAAAT AAAT AAAT AAAT AGGG 

AAGCGGTTGCTCATCAGAATGTGGGAGCGAATGACAGAGGGTTTCTTAGA 

ACCAAATGTGGCCGTGGTTTCTGTCAGGCGTGCTTTAAGTGAGTAGGAGA 

GGTGAGAGAGGCCTGGCTCAACAAAAGGGCTGGGGATTGTCCCTGAAGAA 

CCAGAGCTGANTTNCATCAGGAGTAACANAGGTAGATAG 

>Concig41 

CCGCGTTGAGGTTCCACGCAGTTCAAATTATGTCCAATTATCAACATTAA 

TGCACATTTTCAATAGAACCTGTTCCGGCTTTTCTTAGGAGGGGGGCGGG 

GAGACGTTGTTCTCTGGGAATAAGTGTACGCAGGAGGCTGAGAAGGCTTC 

ATTCCATAGCATTCACTTACCTCCAGCTGTAGAGTGGGCTTATCATCTTT 

CAACACGCAGGACAGGTACAGATTTTTTTCTTTGAGGCCCAAGGCCACAG 

GTATTTTGTCATTACTTTCTTCTCCTTGTACAAAGGACATGGAGAACACC 

ACTGAAGAAAGAAGGGGGTCTTGTGGTTAGGGACACAGCAGTGCAGGGTC 

ACCCCAACCCCTAGGCCCCATGAGTAGGATACATGTAATTTGGTAGCCTC 

TGTGGGAACCCACAGTGAGGTTCCTTGGCCTAAGACACAGGATAACTTGA 

CTTCTCACAGACAATAGCAGGGTCATTTTGTTGATTTAGGGTTTCCCCTC 

AAAGGCCTGAGGGTTTCTCAGAGCCTCATAGCAGTAGGAACGGAGAATGA 

AAG AG G G T C T A C ATTTTAAATG CTG AAGG AAGG AAGGAAGG AAG C C ATTG 

TGTCACTGGCTGGCAATGTGCCCATCCACAGGAGCGGAACAACTTGATCA 

ATGTGGAAGGAAAGGAAAGAGGTGAGGCTGTACTTCTGCCAGAAATCAGG 

GACCAGAACTGTTTCAGGAACAGAGAGTAGCCCATGGGAAGAAACTGGGA 

GAGGAGAGGCTGAGCTGGGAAAGTGGCTCCAAAGAGAGACACTCATTTTG 

ATCTTCCTCAGTCACAGCAGTGTCAATTGGAGGCCCTGGGATCACTCTTA 

CTACCCGATTCCAAAGAAACAGGATTTTCTTGGCCTGGCTGAGAGCAAAT 

AGCTTCCGCCTGAGTGAGGCTGTCCTTCAAAGTCAGCAGCCTTAGTTGCC 

CACACTCCTGTGCAGAGGCTTTGGCTACTGTGGCACGATGCCAGGCAGAT 

CACCACAGCTAATGATGGGTTCACCGCACTTGAAACTTT T GCCCGTTACA 

GCGGAGAGATATAAGTTCCTGCTGGGCGGTAAAATTTCCCTACAAGGAAC 

CACCTGGCATTGGGTGGGACGGATGTTGGGGCAAGGGGGGAAGACTGGGG 

AGGGGGATGGACACATTATCGCTCCAGCACTCTTGTTTCAGCCTCAACAA 

CAGGAAGAGAGAACCCACAGGCAGTTAGGCCATGTCCATCAAATGACCCC 

ATATTGTGGAAGAATTGACATTGCACTATGCCCAAGAGACTTGGGTGGAC 

ATGGTCCTGGGAGTGCTTGAGCCGTCTAATTTCTCAGGGTCACACTCCTG 

TTAACAAATGCACTGGCCAGTGCAATCAAATGTGCCATTTCTAGGACCAA 

AGTTTGTATATTCCTTTTTAATATTTTTTTTCACTTGTGTTGATCATTTG 

CCTTAAATTAACTTTCTACTTTGTTTAAAACATGGAGAATTAGCAAGCTG 

CCAGGAGGCCAGGCAGGGAAACCAGGATGTTTCCATTTACCTTGTTGCTC 

CATATCCTGTCCCTGGAGGTGGAGAGCTTTCAGTTCATATGGACCAGACA 

TCACCAAGCTTTTTTGCTGTGAGTCCCGGAGCGTGCAGTTCAGTGATCGT 

ACAGGTGCATCGTGCACATAAGCTTCGTTATCCCATGTGTCGAAGAAGAT 

AGGTT GT GAAATGTGGAGCACATGTTGTTTAGGTATAAAATCAGAAGGGC 

AGGCCTTGTGAGGCGAGGTGGCAAAA7TTGATTTCTTGGAGGACACCTGA 

GCATATACGGTCAAAGTCTGATGACAACACCAGTAGGGATGAAGCTGGGA 
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G7GGGG7GGC7AAGAAC. .CTGGACCTGACACTATTAGACATGGG^G'C-.-iil 

CTTCAGGTCTATTACTGCTCACTGTGGCCGAGCAACAGAGCTACTTAGGT 

AAAATGGTGATGGTCATAACACTAGCCCACAGGGAGGTTACGAACCTCTG 

G7GAC AA7 GTAAGTGAAAGG C C C CTG AGAAAG AGTGAGGGAGTTG CAAAT 

GT C AGTAG C CATCAAGATCTTCTTTAAGAAT AGTTT C CAC TAAAGAGATG 

ATTGCTTTGGTTTCCAGCCTTCTTTGTTTTGTCTCCCCGCTGGGCCTTCT 

ACCTTTAAAGGGCTTTGGCTCTGGGGdAATTGAGTTGGCTGGGGCTTGAT 

GAC77CCAAGAGGACACAAG7GGAGA7C7AC7GCC7GC7C77GGC7AAC7 

ACC77C77CAAAGA7GAAGGGAAAGAAGG7GC7CAGG7CA77C7CC7GGA 

AGG7C7G7GGGCAGGGAACCAGCA7C77CC7CAGC77G7CCA7GGCCACA 

ACAACTGACGCGGCCTGCCTGAAGCCCTTGCTGTAG7GGTGGTCGGAGAT 

7CG7AGC7GGA7GCCGCCA7CCAGAGGGCAGAGG7CCAGG7CC7GGAAGG 

AGCAC7GCGGAGAGAGCGAGGGAGGGAGCC7GG7GAGG7GG7CC7GCCAG 

GAACCA7GC777GACATCAGAGAG7AGAAAGC7CAGAGAGGAGGAAAGGG 

C77GAAAGAA7CCCGAGCT7CTAAAGA7CA7CCC7C7C7GGGCCAGGCG7 

GG7GGC7CA7GCC7G7AA7CCCAGCAC777GGGAAGCCGAGG7GGA7GAA 

7CA777AGG7CAGGACT7CAAAACCAGCC7GGCCAACA7GGCGAAACCCC 

77C7C7AC7AAAAA7ACAAAAA77AGC7GGG7G7GG7GGGG7GCACC7G7 

AA7CC7AGC7A77CAGGAGAC7GAGGAAGGAGAA7CGC77GAAC7CAGGA 

GG7GGAGGATGCAG7AAGCCAAGAT7G7ACCACTGCAC7CCAGCC7GGGC 

AACAGAG7GAGAC7C7G7C7CA7AAAACAAAACAAAACAAAACAAAACAA 

AA7AAAA7AAAA7AAAA7AAAAAGA77A7CCC7C7C7GAAGC7CAAGGAG 

GTTAAGGG7G7ACTCAAGGGCACACAGCAGG7TAGAGGCAGACTCAAGAT 

7AGAA7G7GGGC7TTCTGACACCT7ACAGGC7ATTCTTTTAGAATAAATC 

CCATTTCTACTTTGTTCATCTTTTTTGTACATGCCCCACCTACACCATAC 

A7G7A7ACC77C7C7ATA7C77777G7A7CCC7AA7GC7G7CACAC7A7G 

A7T7GC77T77CA7GCAGA7GACCA7AACA7777CCA77CACC7A7GC7C 

AC7CAGCAAG7A77CAA7TTrTCTACAC7G77C7T7777T7CC77777CA 

7AACAC7G7C7CA7AGGCA77C7GCAAA7CC7G7GAGAGTAC7777TGTG 

AAA7G77ACCAC7T7CC7CT7A77CAGAGAAGC7CCG7ATTAAGGCTTCA 

CTGAGG77GCC77AAGGCA7GA7AA7GG77CAAAGGC77GAAAGACAG77 

AAAGAGACC7G7AAG7GCACAAAAGAAAG77GAGCAGGAGAGAA777CC7 

GCC7GGAGCAGAGCCAAGCTGC7GGAAGAGGCAA7GGGGGCAAAGGCCAG 

GCAGACAAGCCAA7GGGCTCCTCCCACAGC7GCAGCCAACAAG77A7GCC 

AG7C77AAAAC77C7AAAGAAA7A7G77777AACAAGA7TGAGGAC7GGA 

7TA7GAGGC7AGGGGAGGC7A7CACAAAC7GGAA7AAAA7AAAGCCAGAG 

AAAAG7GGC7GCC77CCAACC7GCACAAC7GACC7AGC7AGGC7GA7GGC 

7GGGCCACC7AGGAAGGC7AC7GAGCA7CA7A7AAAACAGAAGGGACAGC 

AGGAA7A7AACA7GGCTC777G7AAGGA7GAG7C7GAAAAA7GACCA77T 

GC7GCCCAAA7GCCC7TAGC7ACAAC7GAAAA7A777CAGAAC7GGAGG7 

7GCAGGA7GC7GGAA7C7CAGAGA7CA7CCAGC7CAGCCC777AT7T77C 

AGA7GAGG7CCAAAGCGGGTAAAA7GAC77G7CAAGG7CAAACAGCAAG7 

GAA7GG7777C777CAAGTC7CAA77CA7C77TTrG777ATA7CA7C7A7 

G7C77G77G77A7AAGC7TCACCCCAGG7AGCAAAAAACTA77C7AC7CA 

AAAGGGG7AGACA7A7G7TAG7TC7CAAGA7CA7C7C7TGG777CAGAG7 

77AAC7CAAG7GA7TGGCATAGGC7GAA7CCA7C7C77AAAAGGA7AATC 

AAA777A7G77GAAGAC7TGGT7G7C77CC7AC7A7GAAA7GGGAAACA7 

7A7CAC7AC7CC7CCCC7GTCACCACCAAG7G7GGCCACCACCACCAACG 

T7AG7GAG7GAC7G7GG7GA7A7GA7GACCAAG7GGCCAGG7CAGCAAG7 

GG7GCAGCC7G7G7C7CACTGGAAGAGG77AAAG7C777CTAAAACAAAA 

7ACCA7GGCA7CAAAG7GGCCCAGAAC7CCC77C777GAGC777CCC7G7 

GTTAGAGCCCTTCCTTGGGTTGGGAGTTAAACCCATAGTCTTACCTTCAT 

C7G777AGGGCCA7CAGCTTCAAAGAACAAG7CA7CC7CAT7GCCAC7G7 

AA7AAAAACAGGGACATGTC7CAA77A7G7C77C7AAACAGG777ATT77 

7CC77CCC7G7G7ACAAGAC77GAC7G77CA7AAGAAAC7GCAAACAGCC 

7GCC7C7CAAAGC7GCC7GAAACACC7GGCAAG777CACAG7GA7A7GCG 

CAGAACAG7CCAGAAGGCAGA77C7AGGCC7GGCAGG7GGGCACCC7GGG 

7GC7CCCTG77GGA7C77GAGGCC7AACC7C7AGCCCAGCAGAG7CAGC7 

AAAA7CTGAGCTCTCCCTCTCCCTCCAAGCCACAC7TTGCAAAGGGATTC 

CTTG7A7TGTGGGCTTGGAATCTTT7C7CCCCA777GCCTCTGCAGGAAG 
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CCC773CAACAACACA TGGATAGCCTCCAGGTCCCAAGGCT<S?MGC» J 

C77G7AA7GGGAAAG7AG7C777AAA7CAGA777AC77GGCACCC7G777 

GCCAC7GAAAGAGGCAA777AGGGGAAAAA7C7GG7C7CCAAGCACAGA7 

AACAC7C7AC7C77GAAAGAGGAGACC7GC7CA7G77AC7GG7C7CAGCG 

7C7CCAC7GACC7G7AA7AAGCCA7CA777CAC7GGCGAGC7CAGG7AC7 

TCTGCCATGGC7GCTTCAGACACCTGTGTAAAAAGGAGAAAATGAGTGAC 

77CCCCA7GACGGC7ACG77CA7G7G7GA777C7C7CAGCA7CGAG7GCA 

7GGCAG7CA7GCAAAGAAA7GA7C7C7GAG7AAA7GAA7GAA7G7G7GAA 

AGAGAAG7CC777GGG7C7AGAGAAAAGCA777GC7AAACCAAACCCCAA 

CTAGCAATGTATTGGCTAGGAGAGCTGGAGCAGAGGCTTTGACACTAACC 

TTTAGGG7GTCAGCTGTTAGATAAGCAGTATCCATTCCCAGAATATTTCC 

GGAG7CA7AAGCA77A7A77ACACC7GGCA77777GCAAAAAGC7GAGAG 

AGGGAGGCAGAGAGGGAAGGAGAGGGAGAGACAGAGAAAGAAAGAGAGAG 

AGAGAGAGAA7A7GCA7ACACACAAAGAGGCAGAGAGACAGAGAGAC7CC 

C7TAGCACC7AG77G7AAGGAAGA77AAAG7CA7AC77GAGCAA7GAAGA 

77GGC7GAAGAGAATCCCAGAGCAGCC7G77G7GCC77G7GCC7CGAAGA 

GG777GG7A7C7GCCAG777C7CCC7CGC7G77777A7AGC777CAAAAG 

CAGAAG7AGGAGGC7GAGAAA7T7C7C7G77GAA7ACC7GA777CACAA7 

CAAG77AAAGGAAAGGGGAAAAGAG7A77GG7GGAAGC77C77AGGGGAG 

GGGAC7AA7AAAC7GAGA7AA77C7C7GG77CA7GGAAGGGCAAGGAG7A 

GCAAACTATGACACATTTTGCAAATGTATCACCATGCAAATA7GCA77GT 

T77CC7GACAA7CG77G7GCAG77GA7G7CCACA77AAAA7AC7GGA777 

7CCCACG77AGAAGAATGTTTAAATTTAG7ATATG7GGGACAAAG7GGAA 

GACACACAGATT7ATACATGCACATACTTTTCTTCATTCAC7TC77TG7A 

C77AAG777AGGAA7C77CCCAC77ACAGA7GGA7AAA7GGG7ACAA7GA 

AGGGCCAA7AGCCC7CCC7G7C7G7A77GAGGG7G7GGG7C7C7ACC77G 

GG7GC7G77C7C7GCC7CGGGAGC7C7C7G7CAA77GCAGGAGCC7C7GA 

GGAGAAAA77GACC777C77GGC7GGGGCAGAGAACA7ACGG7A7GCAGG 

G7TCAGGC7CC7GACGGAG77GGGGCAACCC7GGAGA7AAGC7CACACAA 

CCC7GCAAGACCAGG7GC7G77ACCC7AGCCAA7C7CA7GGA7GAACCAG 

ATCAA7GCCAGA7GAGC7C7GCC7AAAA7GA777777GG7GAAC7C7GAA 

AAG7GGAA7A77G777C7G7AAGAA7A7CCA7C7GAGAC7C7A7C7C77G 

G7AA7ACCAAGAG77A7CAG777C7C777AACCGAGACACCAGCAAAG7G 

CC7GC7CCAGGG7AA7GCCCAGGGGAGCCC7CCA777G7AGAA7GAA7GA 

GAG7CCAGG77A7GAACAG7GCC7GGAG7G7AGGAACACCC7CC777GCC 

7C777GACAGG7C7GCA7CA7AACAC777777777777777GAGACAGAG 

7C7CAC7C7G7CGCCCAGGC7GGAG7GCAG7GGCACGA7C7CGGCCCCC7 

GCAAG77CCGCC7CCCGGG77CACACCA77C7CC7GCC7CAGCC7CCCCA 

GCAGCT3GGACTACAGGCACCTGCCGCCACGCCCGGCTAATTTTTTGTAT 

7777AG7AGAGACAGGG777CACCA7G77AGCCAGGA7GG7C7CGA7C7C 

C7GACC77G7GA7C7GCCCGCC7CGGCC7CCCAAAG7G77GGGA77ACAG 

GCG7GAGCCACCG7G7CCAGCC7G7AACAC77C77A7AGCAC7GAG77GA 

AACC77GC7CC7CC7GG77CC7CCAGGAAAC7GAAA7C77777GAGCCAA 

G7C7AGCACAG7GCC7GGCA7G7ACA77CAGG7GG7AGAG777GC7GC77 

GAA7GGG7GAA7GGGAA77TGACAGCA77777A77CAAA77AG7A7G7GC 

CAGG7A7CG7GC7CGC7C7GCA77A7CCAAGGGAG7GAGCC7C7G7GCAA 

GTA777GAGACACGAGGGAAA7AGG77C7AC7G7GGGAAAAAGAGCA777 

CATGGACTTGCTCTCCAAGCAGCCTTCTGATTTTTAATTTGGCTCCCAGT 

A7C77GA7A7CAGGAG7CAG7CACAAGAAC7CCA7C777AG7AAG77A7A 

77T7CCACAGGAAA7C7AAAAGC7G77CAACA7G77AG777CC7G7GAA7 

77GA7AAGCCA7AA7CCA77CC7AACAC7GAGCCC7CC7GAAA777GG7G 

TC7GG7CC7GCAGA7AGC7AAAAGCCC7G7C7GGG7GGCC7AGGGAC7CC 

7C7G7777GCC7CCACAGGA7CCAC777GCAAA77AACCAC7GG77C7CC 

CGT7G7AGGAAC7GCCACCTTCC7CAGAGCC7G7C777C77CC77CC77C 

C77CC77CC7C777C77777C777C7C7C7C7C777C777C7777C7777 

C777C777C777C777C777C777C777C777C777C777C777C777C7 

7CC777C777C7C777C7C7C777C7C7C777C7C777C777C777C7C7 

G7CCC7CCC7CCC7C7C7C7C777C777C77777C777C7777C7C7777 

C777C7C7C777C777C7CCC7CCC7C7C7C7C77777C777G7C7C7CC 

CTCC3773TCTCTC7C7TTCTCT7TC7CTCTCTCTCTC7CCTAGACAGGA 
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TCTACCTTTATCCCCC^.^CTGGAGTGCAGTGGTACAATCATGGATTC^i' 

TGCATGATCACAGCAGCCTCAAACCCTTCCTCAGAGTCTTTATGCGGCAA 

CCAGCAGGGTCTGGAGGGTTGGTGGCTCTGTGAACTCTCCTGACAGAACA 

CAGAGATGTCTTTGGTCTGTTGATGTGATTACAAGCTGAACGAAGGAAGA 

TCAAAGCCAGTGACAGGAAGGGAGATATGCAAGGGACCCGAGCATCAGCT 

CTGAGTTAGTCCATTCTGCTTCTGGGACTTGGGATACAGGTCAGAAACCT 

TGAGCTTCTACTTCTCCATCTTCCAATTGTAGCATCCAGGACCTCAGAAT 

CfGCCAGCTAAGAGGAGCCCTAATGATTGTCTGGTGGGATATGGTGGGAC 

CACAGAGATGAAGACATGAATAGCTATTTGAATGTGAACAGCAGACGAAG 

AAATCAAGGCTAGGAGGGTGGAAGTGACTCATCCAATAGCACAGTGTGGT 

TGAAGCAGCACTAGTATCCAGGTTGCATGAGCCCCTGATGCTTTCGCTCG 

AGGGAAATTTTGGAGCCATGGGGCAATGCCCCCTGACGTAACAGTCTCCA 

CAGTTCTGCCATGTCTCATCCTGGCCCTGTAACCTGGACCCAAATCTGCT 

ACCATCCCATCCATCTCAGGAAGTGAAACCTCTTATGTCAAATAGGTTGT 

GCAACGTATGTATCAGATCCTGTCTTCCCAAGGAGACCGCTCAGGCCACA 

GCACTTCCTTCCGATCCCCAATGAGCAGAAAATATCTCGCTATAAACATA 

GTTGGCACTAAGGGAGGGAGTGGAAGAGTGATGATGATGTAGATGGTGAT 

GTAGCCCCAAGGAAGTGGAACAAGCAGAGATGGGGAGCTGGAAATGCCAG 

GATGCTCCAGCTTTTGGGGAATTATTCAGCTCTTGAGTCACTAAAGCCTT 

TCTCAGCTGCAAGTTCCTCTTTACCCTGTCAGGTCATTCTTCCAAGACAG 

GAGACTGACATTTATTCAAAGCAGCAAGTGCCCTGATACCATCTTGTGTC 

TAATCATGGGCTTCGCAGCCAGTTATCAAGGTTGATCTCATCTCATTGGT 

CZTTCAATCATTTTGAACAAGAAGACAAGCAAAATAATCATGGGTTAGTTC 

7TATATTATTGTGTGTACATGCAGTGATGTCTGTTCTTTGTAGTGAGCTG 

TTCCTTCCTTGTTCACCCTCTTGCTTAGAACAGAACTAAGCAATCTGCCC 

CCAACATTTTCCCCAATTTCCCATCTCATTCTTGGCACTGGCTTCCTAAT 

ATTTGTTCTTATGAGTCATTTTCTTGTATCATTTCCATGAGTCCCTCTGG 

GATCTTAAAGTATGAAAAATGTTGTGTGTACCCACACCTGTCTTTGTGGA 

TATTTCTCTCCTTTCCCTTCTGCTTCTGGGATTATTTGGGAATGGGCACT 

ATGATTTTTATCATATCGCTTCCACTTCCTTTATGGCATCATCTCCAATG 

GGCTTCTTCTCCCTCTTGGATCCAGGTTCTCAGATTGGGGACATGCAGAG 

TCCAAGGAACATTCCATTCTCCTCCCTGGTCTAGAACAAGGAGGGCTTAG 

ATATATGAGCAGGTGGCTGGGGCTGGCGAGCTATGTAGTCTCCAATGGCT 

TTTCCCTGATGTCGGAGTTGTTATGTCAGTTCTGGGAGACCAATAAGACC 

TTGTCCTTCCTTTGGATCCATCAGAAAAAGCCCCTGGGTGGGTAAGATGG 

ATGGCAGGGCTCTCCTACTCTATGTCTTTTCTCACACCTAGTGGGTATAA 

GAGAGGGGACCACAAACAGAGGGGGCTCTGGTACCACTTATCCAGGGTCT 

GGAAACATTTTCTGTAAAGGGCCAGATAATAAATGTTTCAGGTACAACTA 

Z T C AAC C TTG C AT C AT TTCAGAAAAG CAGT CAG AT AATACAT AAATGAAT 

3GGTGTGGCTGGACTTGTCCTGCGGTCCCCTGTCTTATATCATTGTATTA 

TATCATTTTTTCTTACATACAAATTTAGAAGCAATACTTAAAAAAAAAAA 

ATTATCTTATTAACTCTTCATAATAACCTTTAAAGTAGATAATATTGAAC 

CATTTGACCTATGCAGAAACTGAGGTTGAGACAATAAATTATTTAAGACC 

GCACAAACAGTAAATG CTGG AACTACGACTCAAATATGGGTTAACTGAAC 

CAAAACCAGATCTTTATTTCTCACTTTTAATTGTTACATATGTTTATTGC 

CTCATCTCCTGTCCACATGGTGCCCATCGGCAGACTCCTTTCTCATTCTC 

AGTGATTGAGTGACATTCTAAACTACATTGGCCTGGCAGATTCACCTCTG 

TCCCCTAAATGTTTCCACATTGTCCTTTTAGGATTGAGATCCTCTCTGTT 

CCCTTGTCTTCCCTCCTTTCTTCTTCTGGCGGTGACGTGCTGTGTGAATT 

TGTTTCTTTCTCCTCTCAGGGTAGTACTGGGACTTTCCAAATCAGGGTTT 

TTAGTGATCTCTCTTCCCTTTTCTGAGTTTCTTCCTTATTCCCATTCACT 

TTCTCATCTATAAGTGGCAGCTTTGTTGCTGGAGGATTTCCTTTGTCCTT 

TTATTCTTCTTTAAGACTTTGTCATAACTGTCAAAAGCAATCCCTTGAAG 

GTATCTGTCCTTGGAATTGTGTGCTTATGATGCTGAAAAATACTCTCTTC 

CTAAAGCTATTATAAATGCT 

>Concia42 

GGCTAGCTGCAACTCTTGAATACAAACACATTCAGACATGCACACACTTT 
CTGGCTCCCAAAAAGAAAAAAAAAAATCAATTTATAATAATTCTGATCCT 
TTGCTTATTTCCACAAACTCCATGAAAATTGTACATTGTCCAAGCAACAT 
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TTCTTAATATTCTCTTi . TCTCTCATATCCATTTTCCTTACTGC^SirC * w - : 

CACCTATCTCTTCCAAACTCCCTGTTAAAATCCCTGCCCCAGCGAACTTT 

7A77CAA77T7G7GGAATGGAGGC7GCAC7GA77TAAA7TAAAAAAAAAA 

AAAAAATCCCTACTCCATGTCCCAGATCCCTAGTTGTTTTTTGTTTTTTG 

TTTTCCTGAGACAGGGTCTTGTGTCTTCCATGC7GGAGTGCAGTGGCATG 

ATCATGGCTCACTGCAGCCTCAACCTGCTGGGCTCAAGTAATTCTCTTGC 

CTCAGCCTCCCCAGTAGCTGGGAGTTCAGGTATGTGCTACCATGCCTAGC 

7AA7T77T77C777TAT777G7AGAGACACGG7C77GCCAGG77GCCCAG 

GC7GG7C7AGAACCCCTGGGCGGACG7GA7CCGCC7GCC7CGGCC7CCCA 

AAG7GC7GGGA77ACAGGCG7GAGCCAC7GC7CCCGGCC77GGG7GCAAA 

TTTGAGCTTTCTCACTTATTAGTGTAAGACATACAGCTAATTTCTAAATC 

77CCAAACC7CAGATTT77CA7CCA7GAAG7GAGGA77A77A7AGAGC7C 

ACTAATAACATGGCTTCAAAAATATATAATGCCAAAATTGAGATCAAAAT 

AATAAATCTATATTACATGGGAGATCTTAATGTACCTCTTATATTATTGA 

TAGACTAAGATGATCAAAAAAATAGAAAGAGAGCAGTAAGGAGAGCAAGC 

ATTTAATCAATAGGACCAATACATTTTAATCAATAGGATCCTCAGGAATA 

TATACAGAATAC CAAACCTAACAACTGCAGAAAACATGC CAAACATTTAG 

GTACAGACATTGTTGGAAAATGCAATCTTGAAACGAGTGGACTGACATTC 

AGAAGATATTAATAAGAGCACTAATGATGGGGATTGCAACCATGTCTTTA 

CTGACTTCCAGAAGCTTCTTACAGTAAACATGAAATCACATAATTTCTTC 

CACTTTCCTACTGTTTCTTGTTCTGGGCTCTGTCCTGC7TACTGTCTAAT 

A7C77GGCCCC77AAAAG77GC7AA7C77CCAAACC7CA77CC7G7GAC7 

GGGCCGC7GG7CC77G77CA7GGGCC77GAAAA7AC7GAC7G7ACAC77A 

TC7GGAGCA7CCAG7GCC7ACCACC7GACCCAGA77CC7CA77GCGC7CC 

7CCC7CCTCCACCTATTGGAATTTGCTCATACCCGTGTGAGACCCCTCCC 

777CCCCCCA7C7GAA77T77A7CAAGACAACGCAC7GCCA7AC7CCC7C 

G7ACCC7GC7C7GGGCA7CAGAC7GAA7G777G777CCA77GAGGA7C7G 

CAGC7GCA7CAG7TTCCCCAGCACCG7CCAACCCC77GAGCA7GGC7AG7 

CC7AAAGCAGAGAATTAGCC777C7A7CCC7GC7GC7A7ACA7GC7GGGA 

CAAA7AA7AAGAAA7GACAGCA77TTA7GA7AA7GCAGGC7GCAGGAGGC 

AGGAGGCAGGAA7CAAA77CG7GC77A7CAAA7AG7GC7CCAA77C777G 

AA7A77GGAC7A7AGAA7A7G7CA7GGA7C7A7GC7CAGG7GGG77CCC7 

A77AC7CAC7CCAC7GAGGCCAGG7TG7GGGA77AGC7G7CCAAGAGGGA 

G777CAG7C7CACAGCA7AGGG7CA77C7GAGAA77AC7GGCCCACAC77 

G7G7GGAGACC7CCAGAGAACAGAA7C7GGG77GG7GCCA7G7AC77CCA 

GGAGGAGAGAAG7GGCAGGA7GCCCAGCCCCACAA7CAGAGGGGAAGGGG 

CAGAGCCACA7G7A7GAAGA7CC7C7CCCCAG7ACG7GCCAA7CACAGGG 

C77CC7AGCT777GGGCCAAGGAAACAA7G7GGGAAGCAAAAAAGGACAA 

T777C7GC7CGC7T7GCA7GAAGAC7GAGCAG7777ACCAGA77CCCAGG 

GAAACACCC77CCACTC7GGG77GAA7G7GAG7GAGAGACA77CAGC7GG 

AACAC7AGAAAAAC7A777CC7GAGCGAC7CACC777AGCCC7AGAAAG7 

G77GGA777G7CCTTCA7CTT7GCCACAG7AGAGAC7GC7GA7AGCA7CA 

GAAC77GGGC7C7GGAAT7AGACAGA7A7GGG7ACAAA7CTGAGC7C7C7 

CAC77A77AG7G7GGGATGTAGAGCAAC7TTTAAAA7CC77CCAAACC7C 

AGAC77C7CA7GCATGA7G7GAGGA7TG7AA7AGGGCCCACC7AA7AGGG 

G77777GAGAA77AAAAAAG77A77CAA7GAACAGCAT77AGCAAGA7GC 

C7GACCA77GAGAAAA7AACAAA77G777A77A77A77G77A77A77AAA 

CA7C777CC7GCACCTTC7GAC7GGGGGCA7CG7A7CA7CAGAAA7AC77 

AGGA7GGGA7GGA77CC7GCA7GGGC7GAG7CAAGGG7GCAA7AA7GGAG 

GAG7GAAG AAGGAAGAAA7GGAGGCAGAAA7 C C C CAGGAG C C CAGCA7GG 

7ACAAGGC7GAGC7AG7GC7GCAGAGCC7CC77GGAACAGCCACAGAGC7 

7GCA7C7GGCCC7GGGAGGAACC7C77C7AGC7GGCAGGACCAGCCACAA 

CAG7GGCCAGGGGA777CCCAGGGCG7GGGC7CC7AGGAG77CA777GGA 

C CAAGC C7G C C7GGAGAGGGG77 A7 AACAGGGA7 C C77 C C C7AC7GGCAG 

G7GA777ACCC C7CGG7GAGAAGC7CAGGCA777G777GA7GGAAGG7GG 

AAGGCCC7G7GC7GGGCCAG7GAC7A7CAGGGA7GGGCGGG7GGC7GGAA 

AA7AGCAAA7AAGACAA7A7GA7AACACAG77AACCACCACAC7A7G7GA 

AGC7ACAA7A7GGG7A7C7G7AA7AGACAA77CCAA7G7AGAGAA7AA77 

C7AAGG7G7CA77C7CCCCGCCAA7GCCA7AAGCACACGGCC7C7GCC7G 

GG777CTCACTGTGGAATGTCC7CC7GGTCTCC7CA7GCCCAGAGAGTGG 
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GAAGTACTCCTACTTT,. XACCGGCTTTCCTGTCATCTCCCTGOSGCC V " 

CCTCAGCCCCCTCTGCACAGGGAGGTTTCCTCCCTGCTGCTGCAGTGCTT 

TGTACTTGTTAGTGGTACCTGCACACAGGTATTGGTGTCCTTGTCTCACC 

AC C CTACATCACTGTAAGCTCCC CAGGAGCAGGCTTCCTGTTTGACTCAC 

CTGTGATCCTCCACCTCCCACCCTGTAGTGCCTCAAGCATTGAGGACAAT 

CACTGGCTGCCCCTTAACCCAGAAATGCTGCCGAGACAGGAGGCCATGGC 

CCAAGTTCCTGGAATGGGGTATTACTATGTCAGCACAAAGGCCTTTGCAC 

AAATGAAGGCTTTAAAAATGCAGTCCTAGTCAGGTGGAGGAGGGCTTATA 

GGATTCCCAGGAATCTGGATCATTCTCTTGAGAGCTTTCCCTTGTCTCTG 

TTAAAACTCACATCCTACGGCCCAAATAACAACAAAAAATGGATGTAAAT 

TCTTGAAATAACTTGTGGATGGGGGAACAAGGCCCACCCCCCAGATCTGC 

CAGAAGCTTCAGGTGAGGGTCCCAAATGCCAAAAAGTCTGGTATCAGAGA 

GGATGGCCAGTGACCTGGGGACACATGCCCTTTGCTGTGTCACTCAAGGA 

GCAGCAGCCTCGGCCCCGCACAGTGACCAGGACCCTGGCTTCCCACGCTG 

GGCAGGAGCTGGTGTCTGATGAAGGGAATGCCTGGCAGCACGTGCTGTCT 

GTCTCCTCGTGTCAGCTTACCTGGCTTTGCTGCGAAGAGGCCACTCGCAT 

TTCTCAATTTTTTATATTTTTTTAATTTTTTAAATTTTTTA^TTTATTTT 

TATTTTTATTTATTTATTTATTTTTAATTTTTTTTTAATTTTTTAAATTA 

TGCTTTAAGTTTTAGGGTACATGTGCACATTGTGCAGGTTAGTTACATAC 

GCATACATGCGCCATGCTGGTGCGCTGCACCCACTAACTCGTCATCTAGC 

ATTAGGTATATCTCCCAGTGCTATCCCTCCCCCCTCCCCCCACCCCACAA 

CAGTCCCCAGAATGTGATGTTCCCCTTCCTGTGTCCATGTGATCTCATTG 

AATTTCTTTAAAGGTGGAATCTCTCAGTGGGGTCTAATCTGTTCAGAAAT 

ATCAAAAGAGTATCCTTGGGAATGACTGGAATTCCAGAGTCATCTGGTAA 

TCCTCATAAAACAACTCCTGGATGTCTCTCAGCACATCTCCCACCTTGAA 

CGCAGGAGGCTGGTTCAAATGGAGGAGCATCGCTCTACTGCACTTTTTTT 

TTTTTTTGGCCTAAAGTGCAAAAGGGGATACGTTTCATGTAAATAAATCA 

ACTGCAAATCGCTAGTTATGCTGAGCCCTGTCCCGTGCTGTGGACACAAA 

GGAACCAAAGGCTTTTCTCCCCGCCCAACACACACATAACACACACACAA 

AAT CAT AAAAACATACAT AC C C C CAACACAT AACAACACA CAACACACAC 

ACAAAATATATACACACAACACACACCAAACATGCCCACAAACCTGTGTC 

CAGAGATAGATCCTACTGGTGGGTTTGTGGTCTCGCTGACTTCAAGAATG 

AAGCCGTGGACCTTCGCAGTGAGTGTTACAGCTCTTAAAGATGGCATGGA 

TCCAAAGAGTGAGCAGTAGCAACGTTTACTGTGAAGAGCAAAAGGACAAA 

GCTTCCACAACCCAGAAGGGGACCCCAGCAGGGTTGCTGGTTGGGGTGGC 

CAGCTTTTACTTCCTTTTGGCCCCTCCCATGTTCTGTTTCCATCCTATCA 

GAGTGCCCTTTTTTCAATCCTCCCTGTGATTGGCTACTTTTAGAATCCTG 

CTGATTGGTGCATTTTACAGAGTGCTGATTGGTGCGTTTTACAATCCCCT 

TGTAAGACAGAAAAGTTCCTGATTGGTGTGTTTTACAATCCTCTTGTAAG 

ACAGAAAAGTTCCCCAAGTCCCCACTGGACCCAGGAAGTCCACGTGGCCT 

CACCTTTCAACTCCATAATGGCATGAAAATACATATGTTGTACAAAACAT 

ACATACACAAAGTATACATGCATCTCCCCAAATATACACATACCACAGAA 

ACATACACACAGGAACTCAGCTACCTGTCAAAAGTCTGCATGGTGATTGC 

CTCTGCAGTGAGTAGTTAGAAAAGTGAATTTGTTTTTCAATAAATTGGAG 

TCCTTAAAAATCGTTGTAAGATAGAAAATTTTTAAAAGTATATAAAATAA 

AATATGTATGTCCTTTGGTCTAGCATTTACACATGTAGGAATTTATCCTA 

GTGGAG7AATCAATGATATATGCAAAGATTTGGACAAGCATATTAAGCAC 

AGAATTATGTATGCATATGTGTGTGTATATATATATATATCTCATACATA 

TAATAATGTAAAAGTGAAAATAACTCAGATGTTCAAAATTGAGGATTAGT 

TAGACTATGATCTGTCCATATGTGACATACAAGTTAGCTGCCCCTTATTC 

TCTCGAGCTTCAACCTCCTATAAACAGTGTCCCTTGTATATCAGTATTGG 

TACAGATAATCGAACTTATTGAGGTTTTACATGGGGCAATAAAGGCAAGA 

GTTTATGAATACTCCATACTACACTAGGTAGCACCCCCTATTAAAGACAA 

ACTCTTCTCTCTCATTTCCCTTCCTTTCCGGAACCACTTGGTTGAATCTC 

TACAAGTCTCTATTGCAACTGCCTCAACATGGCACCCTCCCTGCATCTCC 

ATCTTCCCTGTCCTGAGAGCAATGGCCTGCTGCCCCCACACTCACATCCT 

CATT C ATTC CAGAAGTGAGCAC CACAGAAGTGC CTACAGTTACC C CAACC 

ACCTTCTTAGAAGATAAGTTAGTGTTTGTTTTGACTTTTTAAAATTTTTA 

CTTCCTCTTTTCCTTCACAATCTCATCCCATCCCAAGAGGTTTATCAAGA 

AGTTCTCTAAAGATATGTGTCTCCTTATGGAATTTAACAGAAATCAGGGA 
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. i * w ■lAtiLwA'iv 4.GGG AAT AACATTTTT C CAGGT CTTTACoAC* 

A7AA7GGAA7ACC77GCAG7AA77AGA7ACAC7A77G7AGAAAA'GtA77G ' " 

ATGAAATGGAACGATGTTTGAGATATCATATTGAGTAGAAAAGGCAAGAT 

ACA77AAG7AGGAAA7G7A7C77ACAAAA7AA777G7CAGACACAC7CC7 

ATATTTGTATGTTATATAAATGCGTATGTGAAGAAAGGCTAGAGGATGAG 

ACCACAG7C77CGG7GAAG777AAGAGA7GA7GC7GCAGCA7GC7CAGAA 

AGGC77GGTATAGTTTTTTCCAGTAATTAAGGACTGATCTTAGGTAAATT 

GTCCA7CC7C7C7AAAC7GCACCACC7777G7C7G7AAAACAGGAAGGA7 

GGTATTTACCCCCAGGGTCATCAAAGGATTTGGTTGGAGAAAAATAAATA 

AATGGGCTGAGCCCAGACCTGGCACAGTGAGAGCACAGTGGTTGACTATT 

GTGCTGGCCTGTTGTTCCTGTGTTATTGACATGCTGCTGGTGGTGGTCCA 

GAAGCTATTACCTTAATTGGTTATGTGGATTTCCCCTCATACTGAGCAGC 

TGTGTGTGGTGTTGTAAAACATAGCCATACACAGTAACTGACAAGGGCAA 

ATGTGATGGAAAAATGCAAGGAAGTGCAGATAAATAGCTAATGGGCTGTA 

GAAGGAAGCTAGTCCTTGGAGGGCTTGATCAAGGAAGGTCCTTTTGCATG 

TCACCTTTGAAGAAGAGGGGACATAGAAGAGGTATAGTGCATCCCGGAGT 

GTACCTGGAAGGGAACATGAAAAGAGGACATTTTTCTCTGGSACATGGGG 

ACTCCACTTGCATGAACTCTGGAATTGGGGCAAAGAACCATCATGAGAAC 

AAGGGCTTCCTTGAACCTCCCAGGCTCATTGGCTGATCTAAACCCTGTGT 

CCCCTCTTTCCTTCACTCTCCTCTGTTTTCTATACC7GTATTATTGGACT 

GGACTGGAAGCCACCTGATCTATCACAAGTACCTTGAAATGTG7TGAATA 

GG7G7GGCACAG7CC77AGCAGAG7GGCAC7ACCCCCACAGGAA777G77 

7A7ACC777GGCA7GGAAAA7AGCAGGAAA7GAG7GA7CAC7GA7AAC7G 

AGGATGCTATTTATTATTGGCCAAAGGAATACTTGTGTTGTATTTGCATA 

ACCACTCACAAACTG7TGATTACAAATGAGTACCAGACCTAGCTCCTTCA 

AG7AAAGGA7C77GAGAAC7GAAGGCAAACAGAGC7CCAGGAG7CCAAGA 

CAGAGCCACAGACCACGAGGA7CCC7GGCCCAGG7AGG7GG7CC7CC7GC 

AC7GGC777CAAGGCCAACAGGA7GGA7GGGGAAG7AGAG7AGCA7C7GG 

CCA7C7AGACCC77GC77777A7CCCCAC7GGAAGCACA7C7GAA777C7 

AAA7A7GA7C7C7GAGACCTGCCCAGAACACC77GC7C7CAGCCCCAG7A 

GCAGCC7GC7C7C7CCCAGGAGGGC77CCAC7AACAAG7AGGGCA77GC7 

GGAGGGCCAGGCAGACAC7AGC77AGGAAA7CCACCAACCC7GGAAA7GC 

7AG7CCC77C7C7GAAGGC7CAGAAGAC7GAC777AGAG7C7AGAAAA7A 

77GG7CC77GGGAACAGA7777GAG7GCAAAGAGA7GGAC77CAGA7GGC 

CAGA7GCAC7GC77C777AGGGAA77C7G7GAAAGC7CCC7GCA777A7C 

77AA7ACAGGCAGCAGA777CA7GAG7ACCCCCGAGGGA7GGCCCCAGG7 

CC7CCAGCC7G7GAGCA7CC77C7G7CC77CAGCAGCACCACAG7A7C77 

7ATA7G7C777GGATACCTACGTTTCTGCCAGACATCTCTTGC7CTGATG 

77C7GGC7GCCAAA77C7C7G7CAAGCGCC7CCAA777777G7G7CC777 

GA777ACCCCAACA7GACAAAGGCAG77G7GC77CA7G7A77CAGGGA7A 

C7GCCAAACCACAAACAGG77AAAA7CAAA7AGCAGA7A7CCC7G77CC7 

AAAGACCCA7CAGC7C7ACCCACC7GC7CC7GC7CACCG7CC77A77G77 

GAGTCC7GAAGCCCTTCCTTGTCATTTTTATTTTTTGCATGAACAATTTA 

G77CCC777G7C7CAC7CC7AAACC777C7CAAAGGA77GGA777G7ACA 

CAAAC7GCC7A7C7C7GCAA7C77AGAAG7GA7A7GA77C7GAACAAA7C 

AC77AAC77T7GA7777TTA7TGG7AAGA7GGGAA7ACCAA77777GC7C 

CAC77C7G7CC7A7G77GGCC7GGGC7GA7G77GAAAGC7C7CGG7CAAC 

7GAGA7AGGG7G7GCAGAA777A7A7A7A7AAA7A7A7C7CC7CCAACCC 

C7CCCAA7GAAGCAAG7CACG7GAG7CAA7CC7ACCC7AAGA7A77AGGG 

A77GAGCC7CC7GGGACA777GG7GGC77AGG7777CA7GAAAAGAGG77 

GCAGAGCAAC7GC77777G77AGGCAAAGA77AGGC7AC7GCAGAGAC7C 

AGCAAAC77C7A7AGAAGG7G7CAGA7GG7AAG7A7777AGGC777GC77 

GCCAGA7GA7C7C7CAAC7AG77AACCA7GC7A77G7AGCC7CGAAGCAG 

CCAGAGACGA7C7G7AAACAAGAGCA7G7AG7G77GGCA7AAA7A7AG7A 

CCGCG 

>Concig43 

GCAA7AAG7C7A777AC7G7AAAG77AA7CAAA777ACA777CAGAACAC 
77AA7C7GCAAGAG7CC777CCAAGACCC7A7ACC7AA7777G7G777AC 
AA7777A7A777G7777C77AAAGAAGACCACCAA7A7AAAC7A7A7CCA 
GCC77CA7GA7AAG7ACA7AAGAAAC7A7GCAAA7AAGGGGGAAAAAAAA 
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CAAAGAAAAATACCTAC . rTACTAATGGTTCACTTCTGAATAGCACA^.^. 

TCATAATGATACAAGCACTCATTACTAGTCTAGGAAAATGAAGATATAAT 

TGCATTAGGAAGATCAAGAGGTAGGAAATGTGGATGTGTGTGGTATAGAC 

TAGGGCAGGACAAAGAACCTAAATCCTCATTTTCTAAAGATAATTGTTAA 

TACGTAAAACTCAAAATTCAAGAAGTAACAGTAAAAGCGGTCATTAAGAA 

ACAAGCACTAAACACCAGATAGGAAGCGAGAGATGGGGGAAGAGGGCAAG 

AATCTGATTATTTTTTGCAACAAATTTTGTAAAACCATTTGACTGTTTAC 

ATGTAGAACTTGGATCTTTTTTAAAAAACACAAAATAATAATACTATTAT 

TTTTTAACTGGATTTTTGAAAAAGAAGATAAAAGTCTCATTTTAGTAATT 

AAAAC T CATT C CAGGTTAGT C CACTCAAAACTT AT ATT C G AAAATT AAAA 

CTTTGGGAGGCTGAGGCAGGCAGATCACCTGAGGTTGGGAGTTCGAGACC 

AGCCTGACCAACACGGAGAAACCCCGTCTCTACTAAAAATACAAAATTAG 

CTGGGCGTTGTGCATGCCTGTAATCCCAGCTACTCGGGAGGCTGAGGCAG 

GAGAATTGCTTGAACCCGGGAGGCAGAGGTTGCAGTGAGCCGAGATCACA 

CCATTGCACTCCAGCCTGGGCAACAAGAGTGAAACTCCATCTCAAAAAAA 

AAAAAAAAAAAAAATTAAAACCTCTGGAAGTTGAGTTTGCAGATATTCAT 

TATGCTCATTTTTAACTTGTATGTTTGGAAAATGTCATGATGAGAATTGA 

GGTTGGGGGATGAGAAAAAAAGAAAAACATCAACCCCACAGCCCATTCAA 

TTTTCAGCCCGACCCACAGCTCCGGGGAAGGGCAGCAGGTCCATCCTTCA 

CTCTTTCTTCACCTCTTTCCCCTCCTTCTGGCTCTTCCACCTCTAAGTTG 

GAGCCCAAGAAGAGGCACTGGGAAATGGAAAAGTCTTTTGTACGTGGTAC 

TTGCCGGGGAAGCTGCCATGAAGACCTGGCCCCACGGTGGGGAGGGAATG 

CCCAGCTGAGGCCTCGTGCCCATGCTAGGATAGACTCGTCCAGACATGTC 

AGGTGGTCTGACAGGGCAAGCAGCAGGAAGTCATGTATGAGTATGAACTG 

ATCTGTATGCAAGGGCGGGGAGAACACGCGGAGGAATGGGGCGTGAGAAA 

ACAGCACAGTACGTTTCTTTAGCAGCTGTCTCTGCTCAGCCATGGGAGTC 

ACCAGAGAAAGAGGCTTGGAGGCGTTATTTTCACTGTGAGATGTGAGTGT 

AAAAAAGTGCCCAAGACACAGTGAGTACCAGGGAGATGCCCTCTTTCCCT 

ACCCGAATGCAGAATGGCCACAGGCCTTAAAACACACACATGGTTCCTCA 

GAGGAGAGAGGCCTCCACAGTGGACACCCGCATTCTCCCCTGGTCAGCAG 

CAGCAGGGCGAGTGCTGGGCCATCATGAAGCTTCACAGGCAATGAGCTCT 

CAGCAATAACAGGAACAGTGCCTGGGGGACTGTAGCTGCAAGACCGATTT 

TCATGTAAGATGGCCTCTGAGGACTCCGAGATACACCAGGCTGAGACTAG 

CTGGCAGCTCCAAGTTCTTGGTCAGAAGAGAACAGGAACTAGGGAAATTG 

GAATTACTGTTACTACAATTCCTTTACATCCGCACAACCATGAGGTCCAG 

AGAGTCTCTCTTATTTTTTTTTTAAAGACAGGGTCTCACTCTGTCGCCCA 

GCCTAGAGTGCACTGGTGTGATCATGGTTCAGTACAGTCTTCACCTCCCA 

GGCTCAAGTGACCCTCCTGCCTCAGCCTCTCAAGTGGCTGGGACAGCAGT 

TGCATGCTACCAGGCCTGGCTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT 

TCGGTAGAGACTGGGTCTCTCTGTATTGCCCAGGCTAGTCTCGAACTCCT 

GGGCTCAAGTGATCCTCTGGCCTCAGCCTCCCAAAGTGTTGGAATTACAG 

GCATGAGACACTGCACCCAGCCAGTATAGTCTTTTAACAGCTTTATTGAG 

GTACGGCTAACATTGAAAAAACTACACAAATGTAAAGTATGCAATTTGAT 

AATTTTGACAAATGTACACACCAGTGAAACTATCACTACAGTCAAAATAA 

TGAACATATCCATCACTCCCAATTTCCTCACGCCCCTTGGTAACCCCTCT 

CTCCCAACTCCCTGCCCCCTAACATCAGACAACTACTGATGCATTCTGTC 

TC CATAGGCTCATTTACATTTTCTAGAATTTTACATAAATAAAATGACAG 

AGTATATACTCCTTCATGTATGGCTTCTTTCAGCCCAATTATGTCAAGAT 

TCATGCTTATGGCTGTGCGTATCCTTAGCCCATCTCTTTGTCTTGCTGAG 

TAGGATACCATTGCATAGACAGACCACAGCTTGCTCATCCATTCACTCTT 

GACAACGTTGAATTGTCTCTGTTTTTTGCAATGACAAATAAGGTTGCTAT 

GTACATTCCTGTATAGACATTTGTAAAAGCACAGCATTTCATTTCTCTTG 

GGTAAAGACCTAAAAGTGGAAAGGCTGAGTCATATGGTAAATATATATGT 

CTAACTTTTTAAGAAACTGTCAAACTGTTACCCAAAGGGATTGTACAATT 

TTACATCCCCACCAGCAGTGTATGAAAATTCCCGTACTTCCACATCCTCA 

CCAATATATGGTGTGGTCAATCTTTTTAATTTTGGACATGNTAATGAGTG 

CAAAATGAGGCCCAGAGTGTCTGAAGTTACATTTGTATCCTTTTTGGCAT 

CCAAAACAGGTGTCAAGCATAGAAAAAACACTTGTTCCTTGAATGGTCAG 

TCATTTACAAGTGGAATTCATTACAAACCGGTAGTTCTACTGGGTTAAAC 

T ATG C C TT ACTGT CAACAGGCAC AT AC AC ATACAGACAGAC AGGAAGG CA 
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CAGAGACAAGGCAGAGO TGATAAGAAGGTGACCTGGGCTCTSGCTG 1 w 

3CCTATCACCTAGTAAAATATTAGTTAAGTAGCCATGAGTAACTCACTTA 

ACTTACCACAGGCTCCATTTTCTTATCTGTAAAATAGGAACATTGAAACA 

GCTAATC CCCAAGGTTTG.TGGATAATCAGAATTACAAAGATCAATGACAT 

TTCTATGAGAGAAACATATTTCCAAGTATTTGATGGAGTACATCAGACAC 

AAAGGAAAGGAAACTGAATA T T TT TGAGG lTTT rri'TTTTTACCAAGAAA 

TTCACATTTTGTTAAATTTTCAGAACTACCTCCTGAGGAAAGTGTAGCTG 

CACC CA7TTAGAATGATAGAAAACATCAATCTGTCTGATTCCAAAGCCAA 

GTTCTTGCTACAACGAGAAATGAAACAACTGGATCCCTACAGATGCAGAG 

ACCTGGGCCCCACAAATGTGAATTCTGTTCCCCTACCGAATAGAGTTACA 

GTT C C AT AATACAGTACTCC CTCACTTTTCCACAGTCTCACATTCCACAG 

TTTCAGTTACCCACAGTCAACTGCAATCCAAAAATATTAATGAAAAATTC 

CAAAAATAAACAATTCAGAAGTTTTAAATTGTGCTCCATTCTGAGTAGCG 

TGATAAAATCTTGTGCCACCATCCCACCTGTCCAGCTTATCGTTAGTCAT 

TGACAT CGTCTGCTC CTGACATC CAAC CATTGACAT CATCATGACT CTAT 

GATCCAGGATCACCGAAGCAGATGACCCTCCTTCTGACATATQATCAGGC 

CAATATCAGCCTAAACACTGCATCACTATGCCCACATCAGTCACCTCACT 

TCATCTCATCAAGGAGGCAATGGATCACCTCACATCATCACAAGAAGAAG 

AGTGGGTATAGAACAATAAGATAATTTTGGGGCAGGCATGGTGGCTCACG 

CTTGTAATCCCAATACTTTGGGAGGCCAAGGCAGGAGGATCCCTTGGGCC 

CAGGCATTCAAAACCAGCCTGGGAAACATAGTGAGACCTCCTCTCTCTGC 

AAAAAAAAATAAACAAAATTATCCAGATACAGTGGTGCATGCCTGTGGTC 

CCAGCTACTCAGGAGGCTAAAGTGGGAGGATCACTTGGTCCCAGGAGGTC 

3AGGCAGCAGTAAGCTGTGATCGTGCCACTGCACTCCAGCCTGGGCAATA 

AAGTG AGAC C CTGTCTCAAAAAAAAAAGGTAATTTTGAGAAAG AGAC CAC 

ATTCATACAACTTTTATTATAGTATATTGTTAGAATTGTTCTATTTCATT 

ACTTATTGTTGTTAATTTCTTTCTTTGCCTAATTTTTTTTTTTTTTTTTG 

AGTCGGAGTTTCACTCTTGTTGCCCAGGCTGTAGTGCAATGAGACGATCT 

CAGCTCACCGCAAATCCCGCCTCCCGGGTTCAAGTGATTCTCCTGCCTCA 

GCCTCCCGAGTAGCTGGGATTACAGGCGCCTGCCACCATGCCCAGCTAAT 

TTTGTATTTTTAGTAGAGGCGGGGTTTCTCCATGTTGGTCAGGCTGGTCT 

CGAACTCCTGACCTCAGGTGAGGCCTCAGCCTCCTAAAGTGCTGGGATTA 

CAGGCTTGAGCCACTGCGCCTGGCCTCTTTGCCTAATTTATAAATTAAAC 

ATTGTCACAGGCATGTATTAATTTATAGGAAAATCATAGACATATAGAGT 

TGGGTACTATCCACAGTTTCAGGCATTCACTGAGGGGCTTGGAACACGCC 

CTCCTCAGATGAGGGGGGACTACTGTCATCTCCTCAATCATTCTTGATTC 

AATCCTCAACACAAATGGTTTGGCCAGGTCTTGCCTCTGGAGACAAAATT 

GCTAAGGATTTAGAGGGGAAAAAATGTAGTTCACTGGGAAAGTCACCTCT 

3CTCCACTGGACAGCAAC7TAAAACCCAGGCCATGACAAGTAGAAAGGCC 

AC C C C CACT CTCCTTCACAC CTGGAGTATTCAGGAGTCAATCAT ATTTCA 

GGACCACCAGGAGCAAACTGGGAAAAACTGAGCTGCCTTGAGGAAAGCAA 

TCAGCTCCACAAGGGGCTTAAGAAACAAGCTCTGGGAGGAGTGGTTGGAG 

AAGAGTTGGGGACACATCAGAAATGCCATCAAATTTCTAAGGGCTACCTC 

GTGGTGTCAGACCTGTGCATCTTCAAGGACATAAACAGATGGGATAAGCA 

GATGAGATTCACAGAGGACATCAAAATATTGGCTCCCCAGAAGGGAGAAC 

ATTCTAGTAACAGAGCTGCCCAGCTGCAGAGTGGACTGTTTCACAAAGCA 

ACAGGTGCCCTGCCTCTTGAATCACCATCTTCACAGGAATGCAGTAGAAG 

GGACTTAACTCCTGCCCTGAAGAAAAGGTTAGGCTAGGGAAACAGCTCCA 

AAATTTTTTAAAAGGAAGCAACATAGGCATCTACTGGGAGTTTTCTAAAG 

CCTTTGTTTAATGAAACTAAAGAGCTGGGACAGGAAATGCCAAATTAAAT 

TAATAGAGCCTTGCTTTAAGACAATGCAAGTGGATGGTAATGAAGGAATG 

AGTCTTAGGCCTTGGATCAACCGTATTAAGCAATGCTGAGCATGGAGCCA 

ATTCTGTTCACTAGATTTGCTCAGAAAGGGCCAGACGAGAAGGATTTTTC 

TAAAGGCACCTACTACCAAAAAGCTGCCAAGGCGTCCAATGGAGCCCAGA 

GAGAATATGCTAACAATAAAAAGTTGAACACCCTCAATAAAAAAGGGTAA 

AAGTAATTAATAGAAAATTACTGAAAGCTTTTTTGAAACCAAAAGTAGTC 

AGCATTGGTAAAAGTCTACAAAAGTGGACACTTTCATATAATGTTGGCAG 

GAGGGTAAAAAGACATAACCTTTTTGGAGGACAATTTGGCAACAGAGTAC 

CAAAAACCTTACAATTGAAGAGAACTTTGGCCTGAGTGCAGTGGCTCACA 

C CTGTAATGCCAACACTTTGGAAGGCCAAGGTGGGAGGATTGCTTGAGCC 
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CAAAAGTTTGAGACCAGc_vJTGGGGTAACACAGTAAGACCTCGTCTCTATu ; 

AAAAATAAGAAAAGTTAGCTGGGCATGGTGGCATGTGCCTGTGGTCCCAA 

CTACTTGAGAGACTGAGGCAGGAGGATCGCTTGAGCCTCGGAGGTCAAGG 

CTGCTGTGAGCCATGTTCATGCGACTGTTCTCCAGTCTGGGTGACAGAAT 

GAGACCCTGTCTCACCAGAAAAACAAGGCAAGAGAGAGAGAGAGAGAGAA 

GGAGAGAAAGAAAAGAAAGAAAGAAAGAAAGAAAGATGGAAGGAAGGAAA 

GAGAAGAAAGAAAGAAGAAAGAAAGAAAGAAAGAAAGAAAGAAAGAAAGA 

AAGAAAGAAAGAAAGAGAAAGAAAGAGAAAGAAAGGGAGAGAAAGAAGGA 

AGGAAGGAAAGAAAGAAAGCAAGCAAGCAGGAAAGGAAGGAAGGAAGGAA 

GGAAGGAAGGAAGAAAGAAAAAGAAAGAAAGAAAGAAAGAAAGAAAGAAA 

GAAAGAAAGAAAGAAAGAGAAAGAAAGAAAAAGGGAGAGGGAAAGGGAAA 

AGAAAAGGACAAAGAAAAGACCTTTGAACCCTGAATTTCACTTTTAGAGA 

TTCATCTTAAGGAAATTCATT CCAATAGAAATTTATC C C CAGGATTATCT 

AAATATTTGCTTTTATTTTCTTCTAGTAATTTTATGGTTTAACTTTCTCA 

TGTTTAAGCCTTTAATTTATTTGGAATTTATTTTGGTATGAGAAAGTGTG 

ACCTTTTTTTGTTTTACTTTAAAAAAAATGTATTACGATTATTATTTTAG 

AGACAGGGTCTTGCTCTGTCACCCAGGCTAGAGTGCAGTGGTGTGATCAT 

AGCTCACTGCAGCCTTGAACTCCTGGCCTCAAGCAATTCTCCCTCTTCAA 

CTTAGGAGTAGCTGGGACCACAGGCATGTACCACCATGCCCAACTAATTT 

TTTTTATTTTTTGTAGAGACAGAGTCTTGCTTGTTGCCCAGTCTTGCAAT 

GTTGTCTCAAACTCCTGGGCTCAAGTGATCCTGTCGCCCCAGCCXCCCAA 

AGCACTGGGATTACACGTGTGAGCCACTGCGCCCAGCTGCCTTTTTATTT 

TTTAATTTTTCAGATGCTTTGTTGGTTCCAAAATAGCACTTATTAACCCA 

CGCTTTCCCCCTCTGGTTTTAAATACTGCAAGTTTGGCTTTGAAATACAA 

CCCACTGCCTTATTCAGGCTACATTCAAGGAAATCTGAGACCAAGAGTCT 

GAAGGCCCAGTTTCCTTCCTCAAACCCAGGAGGTGGTAAATGTGTCACTT 

CCACACTTTCTATCTATTTCTAAGAACTCCTTCTTTCCAAACTCTGACAT 

GC C C CTGGCTCAGGTCTATAGAAATTCC CAGGGTC CACAGACAAAGCAGA 

ACTCACTTATGGGGAAATCTGGGAAATACTTATCTGTTAAACCTGCCCCA 

TATGGTGACTCAGATTGTCTAAAGCCCAAAGCATCATTTTCCACCCCAAA 

CCATTTCCTCCTCCAGACTTCTCTATTTCTGTGGTCCAGAGTCAAGATCT 

TGATATTACCCTAGAGTCCCCCTTCTGCTCTCCTGCATACCCAGATGCCC 

CTCCCTCCCCAGATCCATTCTCCCACCCTCCCTCCCATCAGTTTGGTGGG 

CCCATCACCGCTTCCCCTGGCCCAGGCTCTCCTTTTGTGCGCTTGGAGCA 

GCAGACTGATCTCCCAGCCTTCACTCACTTCATGTGGTAATCTGTTGTGT 

TCATCACTGTCAGAATCTTCTGCATCCCCTCACTACTCTGCTGAAAACAC 

TCTAGTGGTTCCTCATTGCTCATTAATGAAAGTCTAGATATTAAACGTAG 

AAGGCCCAGCACAATTTGCCCCTATGCCACCTACCTCTCTAATCTTTTCT 

CGTTACTCTGACAGACTCTCCGTCTGTCATTTATGTATTCTTTTATTGCT 

TTCAAGTATGGAATAAAGAATTTTATTTATTTATTTATTTATTTATTTGA 
GACTGGGTCTCACTCTGTTGCCCAGGCCAGAATGCAATGGTGCAGTCATA 
TCTCACTGTAACCTCGAATTCCTAGGCTCAAGCCATCCTCCTGCCTCAGC 
CTCCTAAGTAGCTATGACTACGGGTGTGCATCACCACATCTGGCTAATGG 
AATAAAATATTACAATGCCTAATCTTAATTTTCAAAATTTTAAATTACAT 
TGTACCTAATGCCCATGCATTTACTTTTTTCAGTGGGTCAATAGCCCTCA 
CTTTGGCAAAGGTCCCAGGCCCAAGGTAAGGCCTTACTTTTTCCAAACTC 
ATCTTTTGAAAGACATAAGTGCCTGTAAGTTGTACCACATTAGGTTCTAG 
GAATTTTTCATCAAAGACTTTATCAGACTATTTTCCTCTAAGTTGAGAAA 
GAGCTGGGGGCAGAATATGGCACTGAATGACTGAAGAGAAGGCACTGAAA 
TCAGGCCAGAGGTTGCTGGAAAGAGCAATGAGGAACACCAGCAGCAATGA 
GGAGCCGGTGATGATTTTGGCTTCACAGGGAGGTGTGTACCACACCGATT 
TTATCTCTACGTGGATGAACCACAGCTGTCGGCTCCCTTGTCTCCAGGAC 
ATCACACTCTCCACATTCCCTCCCATCTTCCGGCTTCTGCTTCCCGGGGC 
CCTCATCTGCCCCATCCTGGGTGAACACTGGTCGGTCAACTGCTGGGCGT 
ACCTTCCCGCTCTGCACACCCTCCCTGGCCACCCCACCCACTCTCACGGC 
TCGCACTGCAGAGGAGCCGCATCTCTAGCTCCAGCCCATCTGCCTCTTCT 
GAGCTCTAACTTCATGTAGGCGACTCCTGCCGGTGTTGCCTCACAGGCCC 
ATCATACTTCAAAGCATTTTCCCCTCAGAACACCATGTCCTGGCTGCTCC 
CTCCAGAAGA7ACATCTCTCAAGCACATCCCCGCGGCTCTCACCTGGATG 
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ACTGCATTCACCTTCTC JACATTTGCCCTCCTTTGGATGTATATAGA . 

GTTTTAAAATACAAATCTGATGTGCTTGCTCTCCTGCTTGAAACACCTCA 

AAACTGCCTTCAGGATAAACCACTGCCCTTGACATGTTCACAGGTTGCCC 

ATGGCCTGGCCCTGCCCATCTCTTCAGCCTCATCTCATGCCCCTTGCCCC 

TCGCTCTCTGGGCTTCTGCCTCCCTAGCCCTCCTTTAGGTTCTCTAACAC 

ACCATAGTCCTTCTAGTGTTGGGGCCTCTGCAAGTGCTGTTCCCATTGCC 

TGAGACATGAATCCCTCTCCCTATCTCTACCTGCACCTTCATCTGATTAA 

TOCCTACCCTTCCTACTCATGATGTTGCTTTCTCAGGGACTCTCTCTGAC 

TTTTTAAACTAATCAGGGTCTCCCCAGTATATATCTTCATAGCACTCTGT 

ATTACTCCTTTCTTAATGACCACCTGCTGTAGACTGAATGTTTGTCTTCC 

TCCAAAATTCATATGTTAAAACCTAGCCCCAAATGTGATAATATTTGGAG 

GAAGGCTCTTTGGGAGGCAGAGCGCTCATGAATGGGATTAGTAGCCTTAT 

AAAAGAGACCCCTGAGGGCTCCCTTGTCCCCTCCACCGTGTAAGGATGCA 

ACAAGAAAGTATGGTCTATGATCCAAAAAGCAGACCCTTGCCAGGTACCC 

AATATGCTGGCACTTGAACTTCCCAGCCTCCAGAACTGTGAGAAATAAAT 

TTCTATTTTTCATAAGCCACCGAGTCTATGGTATTTTGTTATAGGAGCAC 

AAACAGACTGATGTGCCACCCAACCATGATTATACGTGTAATTTATGGTT 

TCTCTGCTAGTAGGGATGCACCATGGGGTTAGGAACCACGCTTTTCTTAT 

TTCCCACACAGTCCTTAGCTCTAAGCATGTTCCTGAATCAAAGATCCCCA 

TCTTTTATGAATGAAGGAGTCAGTGAATGAATTAATGAAAGAACTGATAA 

C C CT C AAT AATT ATT C CAGC CTTTTATAC CTACTATT AACAAG CTTGCAT 

TCTACTCCAAATTTATTGGGCTTTAACTCTATTTTTGGCCAGCCACZATTT 

GACATTCCCTGAAGTAAATCTATGCTTTCCATCCTAAGTCAAGGAAGGAC 

CTGGACTAGTAGGGCCAAGAAAGGTCTAAATTCCATGGGTGGGAGAGAGA 

GACTAAATCTGAAAGGAAGAATAGATTGAGCAAAGGTGTAGAGATTGGGG 

AAGGCTGGACATTTGGAGAGAAGGAAAAGGAAACTGACACTAAACCAAAC 

AGTCTCACAAACACAATCTCATCCTTCCAAAACTCTGTGAAGTAAGAATT 

ACTATCCCAGGGCCAGGCACAGTGGCCCATGCCTGTAATCCCAGCACTTT 

GGGAGGCCAAGGTGGGTGGATCACCTGAAGTCAGGAGTTCAAGACCAACC 

TGATCAACATGGTGAAACCCCATCTCTACTAAAAATACAAAATTAGCTGG 

GCATGGTGGTGCACACCTGTAATCCCAGCTACTTGGGAGGCTGAGGCAGG 

AGAATCATTTGAACCTGGGAGGTGGAGGTTGCAGTGAGCAGAGATCGTGC 

CACTGCACTCCAGCCTGGGTGACAGGGAGACTCCGTCTCAAAAAAAAAAA 

AACAAAAAAAAAACCAAAAAAAAAAGAAAAAACAAGAATTACTATCCCAG 

TTTTGCAGATGAGGCAATGGAAGCTCTAAAAAGTTAAGTAGGAGAAACAA 

ACATGAAATGTATGTCTTATGCTTTTCCTCATCCTATTTCCTCAGCCTGG 

AATGTCCATTCTCCCTCCACTATGCAAATCTAACTCTTCAAGCTAACACA 

TAGCAATGTCTGAGAAACCGTCCCTGTGTTCACTCTGTTAGCCTCACTTG 

CTCCCTCCCCATCCCTCTGTTTCCTTTCTGTTATAACACTTCTCTATTCT 

GCTGGCATCACAGTCATCTCCACCTGCCTTCCTCACAAGTTAAAAGCTTG 

TTAAGGGCAAGTGGTGTTCTTTGCCACCTCATTCCCCAGGGCTTCTAACA 

CAGTGCCTCATGCATGACAGAGTTGTAAAACAGGTTACCAAGCTGGCTTC 

AGGCAGGTTTGCATGGAACTGTGCTTTACAGGAATACCTGCTCCCCCCAG 

GCCCTGGGTCTTCCTCCTGAGTCCAGGCTCAGACTCTCTCATCCTGCTCG 

TTCTCTCTTGGGGAGCCACAGTAACTTTGAGCAACTTTGCATGGGATAGA 

ATGGCCTATTAGGGGCAGCACAAAGACCCCATGGAGGGAAGAGTACAGAA 

AGGGAAAACGATAATCAT ATITITI TAAGATGTGCAT I ^C T TAACAAAA 

TGCTCTAGTACTTGTCCAGACTTTCAAACTCAAAAAC CTAAGCGTC CTTT 

TCTTGAAGATCATCAAAGGCCCCAGTGGTCCTTCAGGTATGTCAAGCTTT 

CTAGAAAATAAAGGTAAGTCATAATCACTTAACACACATGGCTAAATGGC 

CATTTCCTTCTAATTTATCAGCAACTGTTACATATTTCTATACTAGAAAA 

AATTTATATTTATACTCAGGGTGGTAAGTTAAATTTGCCATCGAAGTAAA 

GCAGAAAGAGCGTAGCATGTATGTATATGTAACTCAACTGTGCATGAGAC 

AAAGATGTCTTGAGGAGAATGAGTCTAAGATGCGCCTGAGCAATAGTACC 

C 
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>Contigl 

GCACCCATGTTTCTAAAGGGCATACCAGCCATAATAACAGGATGGGTGAG 

GATATAGACAGCAGATGACAGAGAGGAGAGTGAAAGCTGGGAATCCCAGC 

TAAAGGCATCAGGTTTAT.GGAATGAGTAGGGGACAATACTGTGTGTGTTT 

ATACACACATGTATATGTGTGTATATGTATACATGTTTATGTATATATAT 

AATTAT ATGGTAC CATTTCTAATTGACAAAATAATCTATCACATTTTACA 

TTATCAGATTTTACATCTATTGTTCTAAATACACTCAGTCATCAGCCCTG 

TGTGTGGGCTCTTACCCATCCCCATGCACACCTCAGCTCAACCACTGATG 

GATGGATCATCTGCCTATCAGAGGTGGCATATTCAGGTGAATCCATGGCC 

ACAGCTGCAGCACTTCCTACCCACGCAGAAAGGCTCCACAAGAGGAGGCA 

CACCCGCTCTGACTGTCCCTAAGCTCCTGACATCTTCACCCCATGAAACT 

GCTGCTCCTGGGTGCTTCCTGCCTTGCCCTGCCCACCCTTGTACTGTTCT 

CACCATTGACACAGCTGGTGCCCGATGCAC 

>Contig2 

NAAAACGAATCGTCACTATTGAAGCCTGTCTCTCANCGGATCGTGACTAA 

GAACCCCCTCCTTGCTTCAAGTTGTCCTGCCTTTCTAGGCAGAGCCACCC 

TACATCTTAAATATATTGATTGATGACTTACGTCTCCCTAAAATATATAA 

AACCAAGCTGTGCTCTTACCAACTTGGGCACATGTGGTCAAGACCTCCTG 

ATGCTCTTGTCATGAGTGGGTGGGTGTTCTCAACCTTGGAAAAATAAACT 

TTCTAAATTAACTGAGACCTGGGTCAGATTTTTGGGGTTCACAGCAACAA 

TTTAAAAAACTCACCZATTGACCTGAAATTTTGACCTTATGCTGTTGCTCA 

CACTCCTCCATGAAAATAGACGCCATCCTATGAGTTCCCTCAGCCATGTC 

ATGCCACACTTCCAACATGTGTCCCCATCCACCATCTGTCTTCTTATTGC 

TGCATCCTACCCAGGCCCTGATCTCTGGACCCATTGTTGTATAATTAAGA 

ATTTGGGGCTGGGCATCGTGGCTGTGGCTCACTCCTGTGATCTCAACATT 

TTGGGAAGGTGTATTAGTCAGGATTCCTCCGAAGGATGCAACCCTAGGGA 

TCCTCTCTATGACCCTATGTCTA 

>Contig3 

CGCGCTCAACCGACCGATTTGCGCGAACCTGCCCATGCCCGAGGACAGTG 

TAATC CTAAAACGTC C C CTGAAT CATAAGGAT ATGAGTGC GAAAGTACGG 

TTCCCTCTGTCACCACTTTCTAACAACGCTATGTCCGATCCGTGCACTAA 

CCCCGCCCAAGTCACTGAAACACTGATGGGCGCTTCCTCTACAGGTATCC 

AGGGCCAATACCACTACTCCCCTCCTCCCTGTCCCCCTTCCACTCTCTAG 

AGGCCGCGGATGCCATCCTCTATTAGCACAACCGAAAACGACGGTGAAAG 

TACCACGAAGCTCACGATCTGATCGGTCGCCCAATGCGGTTACAACGGCT 

GTCATCCCAACCCCCGTCCCATCCTCCATATTGCCCCCCCCTATGAGGAT 

GGCCCTATCATCATGACCTCCAAAATTCTGTCATCTCCCGACGTAATGCC 

GCCCCTCGAACGCCTGACACCATCAAGTCNGTCACCTCCCAAAATACTCC 

TCCTAATCACCAGGCCGAGTATCCCCGGTTCCACAATACCTCCTTGAGAC 

GGGCCGATATCACACAC 

>Concig4 

NGGAGTTTAGGTCAACTAGTAACAAGTGGGATTTGCGACTCAGGTCTATC 

TAATCCTCAAACCCACGTCCTGGACCCCTACACAGACTGCCCTCCCTCAG 

TCCTCTGTGTGGCCTCAAGAAGGGTCTGGACATTCAAGTTTAAAAATCCA 

TCCAAAGAATCTATGGACCCAGTGGTCTCTGGAGTCAATGTTCTGAGGCT 

CAGAAGGGCCAGGCAGGAGGGAGCCGCCTCTACACAGTCCTGAGCAGAGT 

GGGCTGTGTCCCGGCACAGCAGGGGAGATCATAAACAGAATTCTGCCCTG 

GGCCCTATTTAAGTAGGACCTTTAGGCTGCCGGTGTCATGACCACAGGTC 

CCANGTCTGCACGATTGGCTGTGTGTGGAAAATCTTCACTCCTTGCGGCC 

TTGTCCTTGGCAGAGAGCACCGCTGCTTCCTCTGATGGCCACCAGGGGGA 

GGCGCTCCCCTGGGAACGGTTTGAANGGGAGCCTCACCCCACACGTGCCT 

TCCGTGGTACCCAGCACCAGCTGCTACCCATGGTTACCCACAGGCCCAGC 

TCTGCTCTGAAGAAGGAGGAGTGGTGGCGATCANGCCTTGTCTGCATCCC 

GTGGCTGCCCCTTTCTTTTCTTT 

>Contig5 

GGGAGCTAACCGCTCACTGGGATTACAGGTACGCACCACCACGCCTGGCT 
AATTTTGTATTTTTAGTAGAGACGGGGTTTCTCCGTGTTGGTAAGGCTGG 
TCTCGAACTCCCAACCTCAGTTGATCTGCCCGCCTCAGCCTCCCAAAGTG 
CTGGGATAACAGGTGTGAGCTACCATGCCTGGGCTTATATGTTTCTAGTC 
CAAACATTTAGC7ACCTTTTTTTTTTTTTTGAGACGAAGTCTCACTCTGT 

FIG- 4 (1 of 61) 



ENSDOCID: <WO 9906426A1_I_> 



WO 99/06426 



PCT/US98/16102 



TGCCCAAGCTGGAGCACAGTGGCACAATCGTGGCTCGCTGCAGCCTCAAC 

CTCCTCAGGCTCAGGTGATTCTCCCACCTCGGCCTCCCTAGTAGCTGGGA 

CTACAGGTACGCACCACTACACCCTGCTAATTTTTTTGTTTTTGTATTTT 

TTGTACAGATGGGGTTTCTTCATGTTACCCANGCTGGTCTTGAACTCCTG 

GGCTCAAGCAATCTGCCTACTTCAGCCTCCCAAAGTGCTAGGATTACAAG 

CATAAGCCACCATACCCGGCCTACCTACTTTTAACTTGTGGAATTTTCTA 

TAAGGTCANGGATGCCTGNGGGAACAAAAGTTTCTCCCTTGGTATATGCA 

AGTAAAATCCACATGCTGCCTCCC 

>Concig6 

AGGACTGTAGCTGTTGTCTAGTCACCAGGCTGGACTGCTTGGCATGATCT 

CAGCTCACTACAACCTCCACCTCCTGGGTTCAAGGGATTCTCCTGCTTCA 

GCCTTCCAAGTAGCTGGGATTACAGGCATGCACTACCATGCCCGGCTAAT 

TTTGTATTCTTAGTAGAGACGGGGTTTCGCCATGTTGGCCAGGCTGCTCT 

CAAACTCCTGCCCTCAAGTGATCTGCCTGCCTCGGCCTCCCAAAGTGCTG 

GGATTACAGGCGTGAGCCCCCGGCCCACATGTAAAAGTTTATATCTCTGT 

TGTTTCACCTTGTTTTTGACCTAGTCTTTCAGTGATTTGAATCTTGATTC 

AGTCTTTTGTTATTTTAGTGGTACTTCCCAGCTTTGTGTCATCTGTGGAT 

GACATATGAGTCTTGCTTCTTCATGCCAATTTAAGAAGACTGAACGGGAA 

TAGGTCAAAGGCATGGCCATGAGCGATTTCTCTCCAGCTTTTCATGGTGT 

TCAGCTTCAAATCTATTCACATATTGGACCTGCAAGCCATCATCTTATCC 

ACAGGCTATCATCATAGGTGAATGTAAATTGGGTTTAGGTGGCCAAGCTG 

AACGTGAGATATNTTC 

>Concia7 

AGCATGTTCTCTAAAGGCCTATCAAAGCTGACATCAAAGGGATAAGTTCC 

AGTTACCCAGCTGAAGGGAAGGAGGGTGTTTCAGATAGAGGAAGGATAAG 

CATGACCTATTCAAGGCCAGTGAAAGAAGCGTGCAACGGCCAAGTCAGGA 

GAACCTGAAATTGTGTCAAAGAGCTTGGATGCAAAGAGCCGTGGGAGACT 

ATTGGGGGTTTTAAGCAGGGATATAATATTCATTCAAGCATGCAGTAAAA 

GGTCACTGGCACCTGCCATGGGCCAGGACTCGGGCTCTACATGATTGCGT 

CTGTTTTGGAAATATCACCCTGGCTGTGAGATGAAGAACAGGTAGGAGGG 

TCACAAAACTTGAAGCAGAGAGACTGTTGAGGAAGTAAGCTGTTTTTGTG 

TGGACTGTGGCAATCACAGAGGCAGAGGATATAAATGCACAGAGACACAA 

GGCATGTGGGAGGCAGAAGGAATCAAATACAATGAGTGATCAGATGTGGG 

GTTAGAATGGTGAGTGANAAAGACATACTCAAGGTGACACGCCCAGGTAT 

CTGGGTGGATGGTAAGACATTCATGGACTAGAATCGAAGAGGAGGTGGGG 

ATGGACATTCCTTCCGTTTAGAGGGGTTCACCAGGAGGATTTGCCGGAAC 

ATGGAGAGGATTAACCAGGAATCCGGTGCCTTTTTCCAAACTGGGTTGGA 

GGGG 

>Concia8 

GGTGAATGCTTTGGCACGCTGTGTAGATTTTAGGTGACGGGTGGTGACAA 

TGAGTCCGTGTCGAGCGCTGATTTTTTCGGCCTTTAGAGCGAGATTTATA 

CAATAGAATTTGGCATGAGATTGGATTGCTTTTAGTCAGCCTCTTATAGC 

CTAAAGTCTTTGAGTGACTAGATGACATATCATGTAAGTTGCTGATAGGT 

TTCCAGTTTTCCGCTCCTAGGTCTGCATATTGTACTTTTCCTCTTACTCG 

ACTTAACCAGTACCAACCCAGCTTCTCAACGGATTTATACCATGGCACTT 

TAAAGCCAGCATCACTGAGAATGAGCGGTGTGGTGTTACTCGGTAGAATG 

CTCGCAAGGTCGGCTAAAATTGGTCATGAGCTTTCTTTGAACATTGCTCT 

GAAAACGGGAACGCTTTCTCATAAAGAGTAACAGAACGACCGTGTAGTGC 

GAATGAAGCTCGCCATACCATAAGTCGTTTTTGCTCCCGAATATCAGACC 

AGTCAACAAGTGTCAATGGGCTCGTATTGCCCGAACAGATTAAGCTAGCA 

TGCCAACGGGATAAACGAGTCGCTCTTGGTGGAGGG 

>Contig9 

GGGGTGGGGCGCCTGGTGTTTCTAAAGAGGATCTCCTGCCAGAAATGGTG 
TGCTGACACTGTTGTCCTCCTTGGTGTGGAACTTTGGTGGGAAGAAAGGT 
TGGAAAGGGAAATTTTGATCCTTGGATTTAACCCGAGTTTGTTACTGATG 
CTCACAAGACTAGGGGAAGGATAAAGGCAGGTGAGTCACTCTAGGATGGC 
TCANTGAGCTCCACAGAGCTGGAACCACAGGCACCAGGAGGGATTCAGAG 
CAGGCCTCAGTGCACGTCAGCTGAGTGAACCAATGAGCAGGTGATGGGTC 
CAGGCAGAGCCCTGTCCTCTTTAGGCAAAAACCCTTGAAACACCGTTCCC 
ATCCTAGCCTGTGTTCCACCCAAAGCTGGCCAGTCTCCAGGCCCTGCCTG 
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AGCCCCAAGGAAGTGGTATGGTGAAACAGAAGGGCCATTCCTGTCCAATG 

TGTGAGGAACTTCATTTCAGACTTGTTGGAAGCCCTGATGTTCAAAAACC 

TCAATGATATCATTCATTTTCCCCATCCATTCAATGCCCATCCAATGCCC 

ATCCGTTCAATGCCCCTTCCATTCCTCTTCAGGGAAATGAAAATTGTTCA 

GAAATCCTTTCTCTTTCGAGAAACCAACCAAACCAAAACCGCGAAATTCA 

CTAAACTAGCCAAGACACAATCCTGGGTTATTTTCCTTTTCCCAAACCTC 

CTeTGTTTAAATTAATTCTACCCTGGTTCTCGGCCCTTACTGCGAAGGTG 

AACTCACCTAACCTCTCCCAAACAGAGAAGAAACTTCTCTTGGTAAAATG 

GGTTTTAACACTTCTAAAAAACCCCC 

>ConciglO 

GCTATGGTTCTAAAGGTAATGGACTATGGCGTACACAACGTCTCGCTCAT 

CGTCTGCCAGGAGGCTAAGGTATCCACGGACAATCGCTGAGCAACAGTGT 

CGTTGAT C CATCTCTGTACGCACTTGTCAACATGGCAGGAGTACGGGAGC 

TGCGAGAATCCTCTCTGCTGATGTCCCACGGAGCATGCCGTGAGACAACG 

C CAC GAAC GG C C CTCGGAGANANCTACTCTGCAATGAAGACGTAC GAT AC 

ACACGTAGGAGTCCTAGCTCACCAGCCGTATCTAGGTATACTGTACTCGC 

GGATACTCACTCGTGCATGCGGCAATAGATCGATACGCAGTCGTCACGCC 

CATGCTCTCAGTGTGTGACCTTCTGGCGGTAGCGTNGTGGGCGCTATTAC 

TGTGCGCAGCAGGCGCNTCGTACATGTGTCGGGTAGCGATGCCAGGAGCT 

GTAACATAGCAAGTCGCCCCCCTACTCCTATCACTATCCCTACGCTGGAC 

CGCACTCGAGATCTGAACGCACGTCTTAACCTGCCAGTACTCGTGAGACC 

TATACTGCGCAAGCCTTGGCTAGGAGATCCTGCAGCGCCGGCAAAGAATC 

AGCTATGATCCCCTTGCGATTATCGCACACGCACCATAGAGTATGTGCAT 

ATTAACCTCTGAATGTGCTGCAAGCAGACGGTTGCTCAACATATATATGG 

ATGTGGGGAAATCGCCCTGGTCACCGCCACTTGGCGTCAGGAGGCACCAG 

CACGTCTGAGTGTCACGCACGTTACTC 

>Concigll 

GGCCGAATGGTGAATTCATCCGTCGTCTCGAGGGGGTGAAAGACGGGGAG 

TTATGCTGTAATGGCACCGCTCACCCTGGGCTTATGAGCAGACCTAACCC 

TCCCANAGTGCTGGGATTACAGGCATGAGCCACCGTGCCCGGCCCAGTAT 

CTGAACTTCTGTGGCCAGGCAGAAAAGGTCCTGTGTTACTCGTCTCCTTT 

ATCATTCATGTCCATATTCTCCCATTTGCTAACATTTATGTTTCTGCTCC 

ACTGGATTCTTTGGATTTTTCTAGAACATACCCATGCTTTGCATTGCCTT 

GGTCTTTGAATATTTGGTCCACTTTTCCTGCAAAGTCCCCTCTCACCTTA 

TCTTCCTGGTAAACTTCCAGCCAACACCTCTTTACTAACCAGAGAAACAT 

GGTTCAACTGTGCACAGGCTTGCACAGAAACTGTTCTCATATTGTCTTGT 

CATTGTCAATGTGGCAGAGATGCACCTTAGATACCTCTTTGAGAAAGGAC 

TCACTGCCCAGCTGCCTGGCACGTGATGAGCTGATAGCTCCAGCTATAGA 

CTCCTTTAGGGTCAACCTCTGCTTTCCAGTTGAGATCATATCCTTTGCAG 

GGTGGCCTCCCCAGTGATGACTAAGGCAGTGTTACAATGGCCTAGTCATT 

TCCTCCCAATGCTGGACTCCCAATGAACCATCTGCTCCGGAGCTTCCCAC 

TGGGCAGTCAGAGACCTTAGCTAGTCTGCCTCCGAATCAGAAGGCTCTCT 

CTTGCCACTCTGGCC 

>Conrial2 

GCTGTGTCTAAAGATTCACGGCTGTAGTTCCAACTCCCGCCGCCCTCTAC 

TGTGTCCTCTTAATGGCAGTCATTCACCATCTTCCTGTCCCTCCCCTTCA 

TTTCTTGGATGGTGACTGTCACTTTGCTGCAACAGAACCCTGTCCCAATC 

CTTGATGGTTCAATACACACATAGACATTCTTTTTAACAGGGCGGCCTCT 

CAGGTCTTTAATTTTCTTCCCTCCAATAACCTTGTGATGATCCCCCAGCT 

TAGCCACTTACTGCCAGATCATTACCAGTAACTCCAGCCCCTCCTTAATT 

CTAGTTTCTAATATCCTAATCTGTGACCTCACATTCCAACTTCTTCATTC 

TTATCCCCTGAGTCAAAAAATCCTTTGATCCATGCAATCCATTAAGTCAT 

CTACCTTTTCACCATTCTTCGCCCCACTAGGGTTCTCATTCCTTTATTAC 

CCATATGAAATTCCAAGGCCTGTTGGAATCACTCCCTTGCAGCCACTGTC 

AATACTTCTGCCCCTTTTACTTCATCACCCTTATGTGGCAAAACCACAGC 

CCTGGTGGAGTCGATCCTTACCCCTGCTCTGTGCCAACAGCCGCACACGC 

ATGGCTGATGGAGGTTGGAAAAATCCACACATGCAGTGGGCCCTGTATGT 

CCATATACGTATCCAACCTCCAGCCTTGCATATGCCTCAGTGCTGCCTGA 

CAACACATTATATGTTTTCCTTAGTTCCTTCAGTCTCCTGGGTGCCTAGG 

TGAGTATCTCAGACATCCTTCTCTCTCTGCAAAGCTCCAACACCTCCACG 
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TCACATTCAACTGATGACTGTGTCTCCTATGTCACTTAGATCACAdAGGC 

ATACATAAACAAATCCCAGCCACTGCCAGCACTCTGCACATCTGCGAGCA 

TGGCACCCCCAATCTAGGCCTTTCCTGCTGTCACTTGGGGTGAGCTGATT 

ATACTCGATCCTAGTCATTTCTACTTATGCAC 

>Concigl3 

ZTTAAGGCCTCCCTCTAACATTTTAATTTAAGATTGAAAAAGCAAAGATT 

ATXCTGTTTTGGCTGCGCCTATAGTAAAGTAACCCCTATGNCAAATTTTG 

ACACCTTATAGTATTTGACAGGGATAAGTATAAAATTGCTTGATTGATAC 

ATCCACACCCAAATGTATGCTGGGAATGATTTTGTTTCACGGCACTCATT 

ACTTAATTTTTAAAACTCTTATTTAAATTTGCAATGTTTTAAATGACCAT 

CACTTAAAGTAGTAATCAACAGAGGTTAGGAGAACATAACAATACTCTTT 

CTCTTAGAAAATACAACAGAAATATAATTTTTTACAGTTTTGCTCCCAAA 

CTTTTCTCTGTAATAACATGCCTTACTCACCTTTACAATAGGTTTGTTGT 

GAGAATCTTGTAATGTAAACCCTGGGTGTTCTGTGAAGCATTTTTAAACT 

TCTAGTTTACACTGACTCTTATTCAAGTGTTTTTAAAAATATATTTAAAA 

AACTGGCCAGGTGCAGTGGCTCACACCTGTAATCCCAGCACTITGGGAGG 

C CAAGGCGGGCAGATCACAAGGT CAGGAGTTTGAGAC CAGCCTAGC C AAC 

ATAGTAAAACCTCGTCTCTACTAAAAATACAAAAATTAGCTGGGCGTGGT 

GGCGGGCGCCTGTAGTCCCAGCTACTCAGGAGGCTGAGGCAGAAGAATCG 

CTTGAACCCGGGAGGCAGAGGTTGTGGTGAACCAAGTTTGCGCCAATGCA 

CTGCCAGCCTCTGCAGNGACAGCC 

>ConniQl4 

GGGGGCGGGCCGAGTGATCCTAAAGCCCGCTCGCTTCACAACAAAGCCTA 

ACAGTCCAATCACTTAATGCTGCATTTATTCCTGGGGAAGCAAGTCTCCT 

TTGCACTTTACACAGTGAGATAATCAGTTTCTCATGTGGACCACTGGGCC 

AGGAGGGCCTGACAAAGGGCAGTCTACATTTCAGACTGGAAACTGCTCCC 

AGAACTATTTCTTTCTAGTTCCCACCTCGGTCTGAGGTGCCTGAGGAGAG 

GGACTCAACAGAGGAAGCAGGAGCATAGCTCAAAGTGTCAGAACATGGAA 

GAGGAAAAGAATCCTCACAAGATTACGTAACTTACAGGCGTGTTGCTGCT 

TCAGTAGAAGTTTCATCTCCCTCAATCCTGTACACTTTTCCATACATTAC 

ATACTC AAACTGGTCAGC C CTATGGAGC AATAGCAGCAAAGTTATTCTTA 

ACAGTAATTAACAATATAAAAGATCCCATTTAAAAATGGTTACTGGTCAG 

CCGGGCGTGGTNNNTCNANCCTNTAACCCCANCACTTTGGAAAGCATGCG 

GGCGATCCCAAGTCTGATATCGAAACATCTGCCTAACATGTGCAACCCCT 

CTCTACAAAATACAAAAAATATCCGGGCTTGTGTTGGCGCCGTTATCTCA 

CTACCCGGAGCTAAGTAAGAAATGCTTTACCTGGAAGCGATTTTTTTACT 

TATATCCCCTCTCTTCACCGGGCGCGACCAAATTCTTTAGTATAGGAAAG 

TTTATTGTTTTATGCCTTTGTCAAGGCTCTACTGTATCTTTTCTGTCCAC 

TCAC 

>Concigl5 

GGTTCTGAACAACAGCAGGCGATTCCTAGCCCTGTACCCGGGGCATTGTC 

CAACACTCGACAGGGCTGAATTCGTCCATAACGGTGTGCCCCTCTGGGAT 

ATAGGATGAAATGAATTGATCTGAGTACCTGGGATGTAAAGTTACTAAAA 

CGCCAGCTAGGTTCACGCCCCGATGCTTAAATATGATCGTGGCCTACACC 

TCGTCCAGCAGAAAAAGTACCCTTTCTTCAACACCACCTCACGATCCTCC 

AATTTAGGAGCTATAAAACTCATGACTCTTTATTTACCCCCTGCAGATTC 

TCAATCCAATAGTGTGTGTCTCCCTGTGAACTCACGGATATACCGATTTT 

CCCCACGTCATTTCCACACGTCGCAATCGCTTAGTCATCCCTATGTATGA 

GAATCATGGATGACTATGTTGAAGTCCATCTATAAAGTTCAACCCCCATC 

TCCGTCCCTGATTCCCCCTCCCCAAGATCACCAACGCGACTCGACATATT 

GTTATCGCCCAAGGGACCTCTTGCATCCCCCATATCCACTGGTCACCTCC 

CCTCTTGGCTGGAAGTCACCGGGAAGTTCTCCACATGTTGT 

>Contigl6 

TGCGAGCGATGTTCCTAAACTTTAGCGCCATTGACTCGAGCATGGTCATG 

GCTGTTTCCTG 

>Contigl7 

AGGGTGTTCCTAAAGGATACTACGTTCCCTAAAGTCCAGAGAAAAAAAAA 
AAAGTAACATAATGTGGCTTATTTGGTATAAAAATTTTACAGGAAGCATT 
GTCAAATATGAAATAGTGTTTGGTTTTGTTTGGGCTGTATTTGTATAAAT 
ATGTTATTGGTATGTGTTCCAAAATTA7AGGAAACTCCTATAATTCTGAT 
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ATGACTTGGTGTACATTATCAGTAATAATTATAATTGTTATGGTAAATTA 

TTGTGTGCCATGGAGGTAACAAATTTCCTCATCAAGTGTGTCTTTGACTA 

TGGTTGCCCTAAAACTTTTTGCCATTCACAGACAATTGTCTTGCTTTGGT 

CCTCTTTAGAAGGTGGTTTTATAATCAGCTATAAAACTCTAACGGGTGCT 

CTTGAATGCAGGCTTAAGATAGCTTTGGAGACTGTGACATCAGAATAGAG 

GAAAAACTTTCAGTATTCATGGAGTGCTGAAATATTCATGAATATCAAGC 

AAAACAGGAATTAACTTCATAGATGGAACTAAAAGAATGCTGAAGTAATC 

TTTTTGACTTTTTTTCTTAAAATGTTGATCCTTCGTTTTGTTTTTCAGAG 

TCAAGGAAATTTTTCTGTTGAGATATTGACAGCTTTTAACAATTAAGTAT 

ACTCCAGTGAACACAATTTGGAGCATATTTGTGTCTCTCTATATATATTT 

GGAAACAATNTTTGAGTATTCTTAACTTATTGCAATATT 

>Concigl8 

GGTTGTCTGCTATACCAGTAATGGGATTGCTGGGTCAAATGGTATTTCTG 
GTTCCAGATCCTTGAGGAATTGCCACACTGTCTTCCACAATGGTTGAACT 
AACTGACACTCCCACCAACAGTGTAAAAGCATTCCTATTTCTCCACATCC 
TCTCCAGCATCTGTTGTTTCCTGACTTTTTAATAATCGCCATTCTAACTG 
GCATGAGATGGTATCTCATTGTGGTTTCAATTTGCATTTCTCTAATGACC 
AGTGATGATGAGCTTTTTTTCATGTTTGTTGGCCACATAAATGTCTTCTT 
CTGAGATGTGTCTGTTCATATCTTTTGCCCACTTTTTGATGGGTTTTTTT 
TTCTTGCAAATTTGTTTAAATTCCTTGTAGATTCTGGATATTAGCCCTTT 
GTCAGATGGATAGATTGAAAAAATTTTCTCCTATTCTGTAGGTTGCCTGT 
TCACTCTGACAATAGTTTCTTTTGCTGTGCAGAAGCTTTTCAGTTTAATT 
AGATCCCATTTGTCAATTGGCTTTTGTTGCAATTGCTTTTGGTGTTCTAA 
TCATGAAGTCTTTGCTCATGCCTATGTCCTGAATGGTATTGCCTAGGTTT 
TCTTCTATGGTTTTTATGGTTTTAGGTCTTATGTTTAAATCCTTCTTTTT 

GAGCGAGTGGCGTGTCTNTAGGACGC 
>Contigl9 

GCATGTTGTCTAAAGGTTTGTCTTCCTCCAAAATTCATATGTTAAAACCT 

AGCCCCAAATGTGATAATATTTGGAGGAAGGCTCTTTGGGAGGCAGAGCC 

CTCATGAATGGGATTAGTAGCCTTATAAAAGAGACCCCTGAGGGCTCCCT 

TGTCCCCTCCACCGTGTAAGGATGCAACAAGAAAGTATGGTCTATGATCC 

AAAAAGCAGACCCTTGCCAGGTACCCAATATGCTGGCACTTGAACTTCCC 

AGCCTCCAGAACTGTGAGAAATAAATTTCTATTTTTCATAAGCCACCGAG 

TCTATGGTATTTTGTTATAGGAGCACAAACAGACTGATGTGCCACCCAAC 

CATGATTATACGTGTAATTTATGGTTTCTCTGCTAGTAGGGATGCACCAT 

GGGGTTAGGAACCACGCTTTTCTTATTTCCCACACAGTCCTTAGCTCTAA 

GCATGTTCCTGAATCAAAGATCCCCATCTTTTATGAATGAAGGAGTCAGT 

GAATGAATTAATGAAAGAACTGATAACCCTCAATAATTATTCCAGCCTTT 

TATACCTACTATTAA 

>Contia2 0 

ACGGTTCTCTAAAGACTTTCAAGAGCTGGATTTTATGCTTTAGGTGAAGG 
TGATAAAGTAAAGTGCTTTCACTGTGGAGGGGGGCTAACTGATTGGAAGC 
CCAGCGAAGACC CTTGGGAACAACATGATAAATGGCATCCAGGGTGTAAA 
TATCTGTTAGAACAGAAGACACGAAAATATATAAACAATATTCATTTATC 
CCATTCACTTGAGGAGTGTCTGGTAAGAACTGCTGAAAAAACGCCATCAC 
TAACTAGAAAAATTGATACCATCTTCCATAATCCTATGGTACAAGAAGCT 
ATATGAATGGGGTTCAGTTTCAAAGACATTAAGAAAATAATGGAGGAAAA 
AATTCAGACATCTGGGAGCAACTGTAAATCACTTGAGGTTCTGATTGCAG 
AT C CAGTG AAGG CT CAGAAAGACAGT ACACAAGAC GAAT C AAGT CAG ACT 
TCATTGCAGAAAGAGATTAGTACTGAAGAGCAGCTAAGACACCTGCAAGA 
GGAGAAGCTTTGCAAAATCTGTATGGATAGAAATATTGCTGTCGTTTTTA 
TTCCTTGTGGACATCCAGTCACTCGTAAACAATGTGCTGAAGTGGTTGAC 
AAATGTCTCAAGTGGTACGCAGTCATTACTTTCAAGCAAAAAAATTTTAT 
GTCTTAATCTAACGCTATAGTAGGCATATTATGTTCGTATTATCCTGATT 
GAATGTGTGATGTGAACTGACTTTAAGTAATCAGGATTGAATTCCATTAG 
CATTTGGTACCAAGTAGGAAAAAAAAATGTAAAGCCAGTGCTTAGACACA 

• r*> 

>Concia21 

CGCTGTCTTAAGAACTGGGCTAGGAGTGAGCAGTGAGCCAAGATCGCACC 
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ATTGCACTTCAGCCTGGGCAACAAGAGCAAAACTCCATCTCAAAAAAATA 

CATATATATATATGACCCATAAAAAGGAGATAAATCAACACTTCAGAACT 

GACCCAAACTTGCAAAGATACTATAATTAACAGAAAAGGACAGTTTACTA 

AGTAC~CCGTATGTTCAACAAGTGAAAGATTAAACATATTAAGTAGAGAT 

GTAGAAGATATAAGAAGATCCAAAATGAACTTTTAGAGTTGAAAACTACA 

ATATTTAAGATAAAAATACACTAGGTGGGATTAAAAGTAGATTACACATT 

GGATAAGATAAAAAAAAATGAGCCTGAATACAGCACAGTATAAACTATCT 

TAAACAAAAACACAGAGAGAAAAAATAACTTTAGAGACTTAGCTCTTATC 

CTCTATTTGTTTCTAAACAGAGGATAAGGGGCAGAAAAAATGTTTGAAGA 

AATCATGATTTTTAAATTTCCAACTGAGATAGGAATAGCACTGGGTAGTC 

ACAGGAGGCTGGAAAGACCCAAACAGCAGTTAAAACAGGAACTAGGCAAA 

GAAACCAAAGGATAACAGTAAACCTAAACTAAGGGAGAGAAAACTGACAA 

AAGCTGACTTAGGATAACTGAC 

>Concig22 

C CTGAAT ATAAGC CGCAAGTAAC CAATTAAATTTGTTTTC CAAAATTGTA 

TTAACAATCTATGAAATTTTTATCTTGACCATAGCTATAACTTCCAGAAG 

CCTTTTATAACCTCTATAACCTTTATTAAGGAGTAGGTTAATGCTTCAAG 

AAAACCTTGTTAATCTGACACAGGACCCATATGCTGATCTTGCATCAGTG 

TGGCTTGGACATCAATGATTATGATTAATTTATAGAGAAATTGAACTTAT 

TTTATCTCTCAAAATTGGCCCTTACAATCTCACACACCCACCTCTTCCAC 

TATAGTTCCTGGGCCTTGAGTTGAATAGCTTTAATTTCTGGCTCTGTGTT 

TCAAGAATGCAGTTTATTTTGATTGGCATTTTCTACCAGTCCTGAAGATG 

AACCT^TAATTGCTGTCAGTATTTAAGATTTAGCAGGACTTGTCCTTTTA 

AGAACCAGGAGTCAAGCCCTATAACTCAATGTCACAAGGACTTTAAAAGC 

ACATACATAAAGATATATGGATGTAATAATCATAATTTTTAAAAAATTGT 

ATTAATCTCAGTGTTTTCTAAGCAAACCAAAACTTAATAATAATGGCATA 

GAAATTATTTCAATAAAACATAAAATCTGTTAAGCCAGTTACCAAAAGGC 

AAAAGAAAAGACCTTCTGCAATGCACAGAATATTATGTTGGAAGAAAACA 

TTTCCTTTAGACCTTTAAGAAAACATTGTTAGCATCAGGACACAACAAAC 

AGAATCTGAGGGTAAAAAACGTATATGAGCTGAAGGGAGTTGAAGGAGGG 

CATTACT AT TT C C CAC C CTTTTAAAGGGG AG AGAAAAC CTAAAACAGCAA 

GATGCAATAAAAGCTGAACTTTGGGTTAAAAAAAAATTCTTAAGTCTCTT 

ATAATTTATTAAGAGTGAATCAACCCCGTAAGAAAATTTCATTGTTCTAA 

CCAATTTTTTAATATATAAGTAGTTTTTTAACATCAACCCAATCTCTAGA 

AAGACCATTATAATTTCCCTTTAATTATAGACAACTTTATCATATAAAAG 

TTTTTTTAAATAAATCCTCTTATTGTGACTTACACAGACTATTCATGACA 

TGCTTGGACTTTCTGGTTTGTCGTGAACATCCTTTTCTTTCTTTCTTTCT 

^TTTTAAATTTTACTTTACGTTCTGGGATACATGTGAAGAACATGGAGGT 

-"TATTACGTAGGTGTACATGTGCCATGGTGGTTTGCTGCACCCATTAACC 

CGTCATCTATATTAGGTATTTTTCCTAATGTTATCCCTCCCCTTGCCCCC 

CACCTCCTGACAGGCCCTGGTGTGGGACATCCCCTCCCTGTGTCCATGTG 

TTCTCAATGTTCACTCCCACTTATGATTGAGAACTGCAGTGTTTGGTTTT 

CTGTTC 

>Contig23 

GCTAAATATAAGCTATGATAAAACAGTTGGCCCTCTGTATCATGGGTTTC 

ACAACTGTGGATTCAACTAACTGTGGATGAAAAATACTTGGGAAAAAAAG 

AATGGCTGCATCTGTACTGCACAAGTGCGTGCTTTTATTCTCGTCATTAT 

TCCCTAAGCAATACAATATAACAACTATTTATATAGCATTTACGCTGTAT 

TAGGTATTATAAGTAATCTAGAGATGATTTGAAGTATACAGGAGGATGTG 

CTTAGGTTACATGCAAATATTATGCCACTTTATATAAGGCCCTTGAGCCT 

CCTCAGATTTTGGTATCCATGGCAGTCCTGGAGTCAATTCTCCTGCAACA 

TCTCCATTTGTTCAGATTCTCTTCTATATCATGTTTATATCAGAAAATCT 

ACATAAGATTTTTTAATGTGTTCATATAGGTTTTGTGTATTTTTGGTTGT 

TAATCCCTAGATATATGCAGTATTTATTGCTATTATGAGTAGTGTTTCTT 

TACCATGTATTCTAGTTGGTTATTGCTGACAGAGAAATGTTGCTGGTGTT 

TCTAAGTTACCTTGTTTCTAACAACCTTGCTGAACTCTTATTAGTTCTCA 

TAGT' T " T *TTAATTAATCTTTCTTAGTTCTGATAACATAATCTGCAAATAAT 

GACAATTTTATATCTTTCTTTCCAATGCTTATATCTCTCAGTCCTCTTTA 

TCCCAAAGTATTTTCCAGGATCTCCACTATAACATTAAATAGTAATAAGA 

ATTTCTGTCTTGTTACTGATCTTAAGGAGAATAAATTTAAATTTCCTCTG 
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-CAGGTTTTATGCTTGATATAGATTTGTGATATATAGCCTTTCAG^GGt-r 

AAAAAAAAATGCTTTCCTAGTAGTCCTAATTTTTTAAAAAAATCATCATA 

AATAGATGTTGAACATTATCAAATGCTTTTTCTGCATCTATAGAGATAAT 

CATATGGTTTTTTACTATTTATTAATGTAATGAATTAGACCAATTTTCTA 

ATGCCAACTCTTTCTTGTATTTGTAGGGTAAATCCTATGGGATCATAAAA 

TACTTTTAATACATTGTTAGATTTGAAGAGTTAACGCCTTATTTAGAACG 

TTTTCAGTCACATCCATAAGTGAAATGGCACTATAGTGTCTATTACTATT 

ATATTTTTCTGGTTCTGAAACCAAAATTATACTCACCTCATACAGTAAGT 

TGGGCAACTTTTGTTCTTTTTTTCTGAAACAATTTGTGTATAGAAGAAAT 

TAACTGTTCCTTGAAAGTTTGATAATAATCATCCAGAAAATTATCCCCAT 

CTAGGGCTTTTACAAAAAGGAGACTCTAGAATGCCATTTCGGTTTCCTTG 

ATGTGTATTGGCCTCTTTCATTTAGGCTTTTGGATTTTTTAGGGCATTTT 

*TT C ACT ATAGG CTTTTTACCGG 

>Contig24 

CATAAACTTCAGGTTGGATGTTCGGTCAAAGTGGTCCGGCGATGCGAAAA 

CG AG AGGG CTCGAGGACTGGGCAGAGAACT ATTTGAAGGT AXCTCTCAGG 

GGAAACCAAGCGGAAGGCGGGGAGTAAAATTGGGAGGGAGCGACGGCCTT 

CAAAGAAGGGGCTTGCATTAGATCGGCGAGATCCGGGAGGGTCTGGTGGG 

GAGAAATGACTAGAGGACAAATCTAATGGAGAGACAGACGGAGATAGATA 

TCGTGACAGAGAGAGGGACAGTGACAGCGCACAACAGTGCAGGGTCCATG 

AGTACAAGGCCCTTAAGTGTACACCCCAGCCGGAGTCATGGCAATTCGAT 

TCCTGTACTGACCACCCAGGATTTGGGTAGACTGTACGAGTTAATGAGCA 

^GGTCCCCAACAAGACTGCTTCGACCTCAGATGCAAAGCACACTTCAGGG 

GTCCCCAAGCCACTCATGTTTTTTGAATGACTGCCATAAGTTCAAAAATT 

CCCACAATTCTCTCAGATTCAATAACTGGGTATAACCACTCATAGAACTC 

AAGAAAATGCTATCATTATTATTACAATTTTATTATAAAGGATACAAATC 

AG AAGGACT AG C CAAATGAGGAG ACACATAG AGAGAGGACT AGT AAAAAA 

CAGAGCTTCTGCGTCCTACCTTCAAGGAATCAGGATGCAC CAC C CTC CCA 

GCACATCAAGTGCTCATCAACCAGGAAGTTCCTCTGAGCTCCAATGTCCA 

GAGATTTTAGGGAGGATTCATTACATAGGTATCATTGATTAAATCATTGG 

CCATGTACTTGAACTCAATCTCCAGTGTCCCTCTTCTCCCTAGAGGTCTG 

AAGGGTTGGCTAATATCATGTGGCT CAAAGC C C CAACTCTAATT ACCTTT 

TTGGTCTTTTCAGGGACTAGACCCCATCCTGAAGCTATCTACAGGCCCTG 

C CATGAGTT AGCTCATTAACATAACAAAGACACTTAT ATTACTCAGAAAA 

TTCCAACAGTTTTAGAAGCTCCATGTCAGGAACCTGGGACATAGATCAAA 

TTCTTTTTTTTTTTTTTTTTTTGGAGACAGGGTCTTGCTGTGTTGCCCAG 

GCTAGAGTGCAACGACAGATCACAGCTCAATGCAGCTTCAACTTCCCAGG 

CTTAAGTGACCTTTCCACCTTAACCTTCCAAGTATCTGGGACCACAGAAA 

ATGGCTAATTATCCTGGCTGATTTTTAAACTTTTTTTTTTTGTAGGGATG 

GGATCGCCCTGTGTTGCCAAGGTTGGTCTCAAACTCCTGGGTTCAAGCAA 

TCATTCTGCCCTGGCCTCTGTGATGGTTAATACTGAGTGTCAACTTGATT 

GGATTGAAGGATACAAAATAATATTTTTGGGTGTGTCTGTGAAGGTTTCG 

CCAAAAGACATTACTTTGAGTCAGTGGACGGGGAAATCCCCCCTTCCCCA 

TGGGACGGGGAGACCCCCCTCCATCCAGGTAAAAAAATCTAATCACCTGC 

AATGTGGCAGAAATAAAGGAGGGAAAAAACGGGGACCCCTANATGGGTTA 

TTCTCCACCTAATTCTTCCCCCAGG 

>Contig25 _ 
CCATGTATTTCATTTCTACAGACCCTGAGATGAATTTGTCATTGCCACGG 

GGTCCTGAAGTTCAAATACTCTATTTGGTATCCTGCCCCTGTGGTTAACT 

GTGATCATTTCACTCACCTTGTTTATGATGAGAGGTGCCACCATCTGGCC 

TCCTCCACTCTGCAATCCTGTTAATTCCTATCAAAGCTGAAAACCTGCTG 

CAGCACCCACACCATCACCTCCAGCCTAGAGAGGGAAGCTACCAGTGAGC 

TCTCCTGGATGCCGGTGTGCCCCTCGCCAATACATTTCTTCTTAGTCCCT 

TGGTCATCCTGAGGTGTGTGATTAATGGACAGCTATGTGGATTGCACATA 

ATAGATGTACTCCAGCATCTTCATCCCTGATTTTCCTTTACAGAAATCAC 

T CAACCTT AGCAACATGTGAAAATCAC CTAAGGACATTCTTTAAAT C CCT 

CTGT C CACATGGCAACACAAAC CACTTAAAT AAG AAT CTCCAGGGAGTCA 

CTCAAGCA.TCAATGTTTTTTAAAGCTCCAATTTTAAGGATCATTACATTA 

TGTCGAAGAAATTATAGTATTTCAGCCTTACTGACTGTAAACCACCACCA 

TATCTAAGCATCCATTAGTCAACCTAGCAGACAATAAACTAACATTACCT 
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CCAGGTACTCW^TCAATTCATTGC 

CTTATTGACAAATTCAGCCCAATCTTGGTTGAACACATTTAGAATATATT 

TCCATGAACAATATCCGGTTGACGAGTTTCTTTAACTTTTTGGAGTTTAA' 

GCCATTTCCTTTCACAGTAGCCTTGTTAATTCCCTGTCAATGC7CCATGG 

GGGTCATGAAGAGACCTCTTATTAACTGTGAAGCAACTTGGCTCAGGTGC 

AGACACTCAAATGCTTCACATGCAGTGGGAAAAGAGAGTGATTGTCTAC 

>Gont ig2 6 

TTTAAAAAGAACTGAGTCTTTATTCAGTCGATTCTTCTAATCTATGAACA 

TAGCATCTCTCTCAAAGCATTTAGTCCTTCTTTAATTTCTGTCATTAATT 

TTTTAAAATTTTCATCCTAAAGATTCTGTATATGTTTTGTTGAATTTATG 

CTTAAGCATTTCACTTTCTTGGTAACAATTATAAATGATTTTGTGTTTTT 

TATTCCACTAGTTCATTTTCAGTGTGTAGAAAAGCAATGAATTTTTGTGT 

GTTGATCTTTGTTCCAACATCTTGCAACATTATTGAACTCATTTATTAGT 

TCTAGGAGGTTTTTTCATTTTTCTTGTAGATACCTTGAGATTTTCTATAT 

AGACAGTCATGTTGTCTGCAAACAGGCACAGTTTTATTTCTTCCTTTTCA 

ATCTATATGCCTTTTTTTTTTTTTTTGCCTTATTGCAGTGGGTAGAACTT 

CTAGCACTATGTCAAATAGCATTGGTGAAAGCAGACATCCTTGTTCCTTG 

TCTTAGAGGAACATTTGGTCTTTAATCTTGATTTAAAAAATTCCTTGCAC 

TAAGTTACCGTGTTTTGCGGGAGGGAGAGGTGGGGTGAGGTGGGGATTTC 

CCCTAATGTTTACAAGCTGGGATTTTCTTTTTCCTGTGTCTAATTATTTT 

CCTCATTGGCTTGAAAAATCTGATAAAACATTTTAGGACTGTGTATAAAA 

TAGAATTAGCCAAGTGCAATGTCTTTATTCAGAAGAAATTTCATGGACGT 

TGTGCCTACTCTCTTGGCTTCCTGGCTTCATGGCTTTCCAGATCCCACAG 

TAAGCTCTGGATAGTAGAAGTTATAGTAAGACTGACTTCTAAATAAATGA 

AGTGACTTTAACCTTACTGATATGGCTTAAAGAAAAGGAGTGGCCTTTAA 

GATCCATGAACTTCTCAAACAAAAGTGATAACGTTATCTCCATGCATATA 

TAATACTAAATATAATGCAACTGAGAGAAGTAGGCTGTGGTAAGAAAGGA 

GACCCAAGTGCCATCTGAAGGCAGCACTTACCACTCTGCTTCATCCCACC 

GAGGAAACAAAGCATGAGTATTGCCAGATTTTCTTCTGTTTCAAGAAAAG 

CCAGAAATCCAGGTTTTTGCGTGAAATGTCCTGATTTTAATGTTGGGAAC 

TAATTTATATTTTGAAATAACATTGTGTGGGACAAGTGAACTTGTATGTG 

GAACTGCTTTCTCCCAGTGGCGACCAGTTTGGACCGTTGATACTCAGCAA 

GTTCAGCCAAGTGCGCCTTGTCATTGTCAGTCATCAAGGTGATGTGTGAT 

TGGTCAAGCAATTAATTTTGCTCAGCATCTCGTGTGTTTTCAAAAGAACT 

GAAGGTTCATTTGC 

>Contig27 

TTTCAGAGCACAATGCGTATTCATAGTATATTGACTTAATTTCTAAGTGT 

AAGTGAATTAATCATCTGAATTTTTTATTTTCAGATAGGCTTAACAAATA 

GAACATTCTGTATATAAATGTGTAAATTAGAGTTAATCTTTCCAATCACA 

TAATTCGTTTTATGTGAAAAAGGAATGAACTGTTCCATGCTGGTGGAAAG 

ATAGAGATTATTTTTAGAGGTTTGTCGTTGTGTTTTGGGATTCTGTTTTC 

TTTTAAAATTGTAAATATGTACTTGTGTGAATGATTTTTTAAAATGATTT 

TACCATTTTTGGAAGGGTATTTAATGATAGAATATCATCGAGCCAACATG 

CACTGACATAGAAAGATGTCAAAGATATATTAAGTGTAAAATGCAAGAGG 

GAAAACACTATGTACAGTCTGAGCCAAATCAAAGCATGTATGTTTTTTAT 

ATGTGTACAACAAAAGGTTTGGAAAGATATGCGCCGAATTGTTAAATGTG 

GTTTCACTTGAGGGGGTGGGAGGATGGGGCCCCAGAGGGGTTTTTATGGG 

GGCCTTTCACTTGGTATTTTTTTCATTTTGTTCTGTTTGAAATTTTGTTT 

TTTCTTTTTAAATGGAGTTTCACTCTTGTCGCCTAGGCTGCAATGTAGTG 

GCGTGAACTCAGCTCACTGCAACCTCCGCCTCCCAGGTTCAAGTGATTCT 

CCTGCCTCAGCCTCCCATGCCTCCTGTGTAGCTGGGATTACAGGCACCCA 

TCACCATGCCTGGCTAATTTTTGTATTTTCAGTAGAGATGGGGTTTCACC 

ATGTTGGCCAGGCTGGTCTGTAATTCCTGACCTCAAGTGATCCACCCACC 

TTGGCCTCCCAAAGTGCTGGGATTTCAGGTGTGAGCCACCACGCCCAGCC 

CTGTTTAAATTTTTTATAAGTATGTACTACTTTTGTAATCAGAATTATTA 

GAAAGCATTTTACTGATTTAAAAGCTTAGACATGTTCAAATGCCTGCAAA 

ACTACTTAACACTCAGCTTTAGTTTTTCTAATCCAAAAAGGCCGGGCAGT 

TAATCTTTTTGGTGCCAATGTGAAATTTAAACGGTTTTATGTTTTTCCTG 

TGTTGTGAATGAAAAATATTTCTGAGTGGTGGTTTTTTGACAGGTAGACC 

ATGTCTTGTCTTGTTTCAAAATAAGTATTTCTGATTTTGTAAAATGAAAT 
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ATACAATATGTCACAGATCTTCCAATTAAGTAGTAAGGGTTTATCCTTAA 

TCCTTGCTAATTTAAGCTTGCATAAGTCACTTTACTAAAAGATCTTTGTT 

AAGCTAGTATTTTAAACATCTGTCAGCTTATGTAGGTAAAAGTAGAAGCA 

TGTTTGTACACTGTTGTAGTTATAGTGACAGCTTTCCATGTTGAGGTTCT 

CATATCACCTTGTATCTTGAAGTTTCATGTGAGTTTTTACCATTAGGATG 

ATTAAGATGTATATAGGACAAAATATTAAGTCTTTCCTTTACCTAAGTTT 

GGCTTCTTGACTAGTAATAGTAGTAGATATTTCTGTAATAAATGTTCTCT 

CAAGAT C CTT AAAATCT CTTGGAAATTATAAAATTATTGGAAAGACAAGA 

ACAGTTTTTATTCATTATATGCATTATTATCG 

>Contig28 

ctttctcaagaaaagggaactggagcaattaaacatatgtaatttttttt 

taaaaaaccctaaacctaaacatctacctatatacaaaaattaattaaca 

atggatcatggactccaatgtaaaacatgaaactctaaacttctagaaaa 

aaaactggagaaaacctttggtacctatgacaaggcacagtttttagact 

taacactagaagtgtgaactatacaagaaaaaattaataatttgaacctt 

atgaaaatcaaattatttgctctccaaaagaccctgttaagaggatgaaa 

actaaattacagattgagagaaaatatttgtaaatcacatatttgacaat 

ggacttgtatctaaaatatctaaagaactctcaaaactcaacattaaaaa 

aaatatctaattagaaaatgagtgaacattttacgaaaggggccttatag 

attagcaaataaaacacttgaaaagatactcagcatcactagccattaga 

aaaatgcatattaaaaccacaataatgtatcgctacacacatataAgaat 

ggtttatgaaaaaatagtgatgacaccaactgttagtgaagatgtggaga 

aacactcatacattgctggtagaaatgtaaaatggcatagccactgtgga 

aaattatttggcagttccttttaaaactaaaaatcaatctaccacacaac 

ccagcaatttcattacagggcatatatcccagagaaatgaagatttatga 

tcacacaaaaatctgtacacaaatgttttatggtcactttattcataata 

gccaaaacctggaaactatccaaatgtccttcaatgggcaaaggattaaa 

cacactgtgatacatccataccatggaatactactcagcaataataagga 

aagaattactgctacacacaagttggattaaactcaaggaaattgtgctg 

agtgaaaaattaacaagccaatctcaaaggacacatagttcatgattcca 

tttgtataacattaattaacacaattaattacagagatggagaacagaat 

agtggttgccagggattatacatggtggacgcggtgaggcgggcctccac 

gccttggagatgaagggggctacaccctttaaagcacacccacgagagag 

ttttgtgcggaggggcccaatttaagtactccgccccggggggggaacac 

aggggcaaacaaaaaaaattggccttgggggtgaccaaacacacaaaaaa 

aaaacaaacacacaaaaaaacaacnatgggtgggaggattaatcgccaaa 

tctgagtaagctatctggacagtaccaatatcgatttcccagttttgatg 

ttgtactataataatgcaagatgttaacattggaagaagctggctgaagg 

gggctcaggaactctctggacatttctttgtaccttcctgtgaatccatc 

attattacaaaataggacattttctaaaggttaaatcattttaattttaa 

aatgtccctgttactgttgaaactcacatctccatatactgatcaagaac 

agcactaatggcccctggcctccaggaattcacaattcctactgactttt 

ctttgaaaccttggccaagtcgcttctcttctctggtcctcaatttttca 

tcttcaaaatgaagattgaatgactattaaaatctcttgcaattcttgag 

atgaagggtcctaaaggaactgaagaggatgccatgtaatgtaaatatgg 

gtttttactccatcagccagccaagacagagggcagacaccaagacatgg 

taaccaaggaggccatgtgtaaacaaagaccatttagacttatgctctgg 

cctttgcagcccaactggtgtggccagttggtggggtatgaagaaaatgg 

ggccttccaggaaccatgttgagtggagataagcagggaggaatgcagaa 

gacatgggggcagtgccagtctcagcccgagccagctacacccacacatg 

gttatgaaagactgacagcctgtaagntgaacacagccctgcctctctta 

GATAGGC 
>Concig29 

GCAAATATGATCTCAGATGTGGATTTACTGTAAAGTTCATCAAATTTAAA 
TTTCAGAACACTTAATCTGCAAGAGTCCTTTCCAAGACCCTATACCTAAT 
TTTGTGTTTACAATTTTATATTTGTTTTCTTAAAGAAGACCACCAATATA 
AACTATATCCAGCCTTCATGATAAGTACATAGGAAACTATGCAAATAAGG 
GGGAAAAAAAACAAAGAAAAATACCTAGTTTACTAATGGTTCACTTCTGA 
AT AG C ACAT ATTCATAATGATACAAGCACT CATT ACTAGT CTAGG AAAAT 
GAAGATATAATTGCATTAGGAAGATCAAGAGGTAGGAAATGTGGATGTGT 
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GTGGTATAGACTAGGGCAGGACAAAGAACCTAAATCCTCATTTTCTAAAG 

ATAATTGTTAATACGTAAAACTCAAAATTCAAGAAGTAACAGTAAAAGCG 

GTCATTAAGAAACAAGCACTAAACACCAGATAGGAAGCGAGAGATGGGGG 

AAGAGGGCGACAATCTGATTATTTTTTGCAACAAATTTTGTAAAACCATT 

TGACTGTTTACATGTAGAACTTGGATCTTTTTTAAAAAACACAAAATAAT 

AATACTATTATTTTTTAACTGGATTTTTGAAAAAGAAGATAAAAGTCTCA 

TTZTAGTAATTAAAACTCATTCCAGGTTAGTCCACTCAAAACTTATATTC 

GAAAATTAAAACTTTGGGAGGCTGAGGCAGGCAGATCACCTGAGGTTGGG 

AGTTCGAGACCAGCCTGACCAACACGGAGAAACCCCGTCTCTACTAAAAA 

TACAAAATTAGCTGGGCGTTGTGCATGCCTGTAATCCCAGCTACTCGGGA 

GGCTGAGGCAGGAGAATTGCTTGAACCCGGGAGGCAGAGGTTGCAGTGAG 

CCGAGATCACACCATTGCACTCCAGCCTGGGCAACAAGAGTGAAACTCCA 

T CTC AAAAAAAAAAAAAAAAAAAAATT AAAAC C T CTGGAAGTT GAGTTTG 

CAAATATTCATTATGCTCATTTTTAACTTGTATGTTTGGAAAATGTCATG 

ATGAAAATTGAGGTTGGGGGATGAGAAAAAAAGAAAAACATCAACCCCAC 

AGCCCATTCAATTTTCAGCCCGACCCACAGCTCCGGGGAAGGGCAGCAGG 

TCCATCCTTCACTCTTTCTTCACCTCTTTCCCCTCCTTCTGGCTCTTCCA 

CCTCTAATTTGGAGCCCAAAAAAAGGCACTGGGAAATGGAAAAGTCTTTT 

GTACGTGGTACTTGCCGGGGAAGCTGCCATGAAAACCTGGCCCCACGGTG 

GGGAGGGAATGCCCANCTGAGGCCTCGTGCCCATGCTAGGATAGACTCGT 

CCTU^CATGTCAGGTGGTCTGACAGGGCAAGCANCANGAAATCATGTATG 

AGTATGAACTGATCTGTATGCAAGGGCGGGGAGAACACGCGGAGGAATGG 

GGCGTGAGAAAACAGCACAGTACGTTTCTTTAGCAGCTGTCTCTGCTCAG 

CCATGGGAGGTCACAGAGAAAGAGGCTTGGAGGCGTTATTTTCACTGTGA 

GATGTGAGTGTAAAAAAGTGCCCAAGACACAGTGAGTACCAGGGAGATGC 

CCTCTTTCCTACCCGAATGCAGAATGGCCACAGGCCTTAAAACACACACA 

TGGGTCCTCAGAGGAGAGAGGCCTCCACAGTGGACACCCGCATTCTCCCC 

TGGTCAGCAGCAGCAGGGCGAGTGCTGGGCCATCATGAAGCTTCACAGGC 

AATGAGCTCTCAGCAATAACAGGAACAGTGCCTGGGGGACTGTAGCTGCA 

AGACCGATTTTCATGTAAGATGGCCTCTGAGGACTCCGAGATACACCAGG 

CTGAGACTAGCTGGCAGCTCCAAGTTGTTGGTCAGAAGAGAACAGGAACT 

AGGGAAATTGGAATTACTGTTACTACAATTCCTTTACATCCGCACAACCA 

ATGTCGCCCAGCATAGAGTGCATTGATGTGATCATGGTTCAGTACAGTAT 

TCACGTCCCAGGCTCAAGTGACCCTCCTGCCTCAGCCTCTCAAGTGGCTG 

GGACAGCAGTTGCATGCTACCAGGCCAGGCTTTTTTTTTTTTTTTTTTTA 

GTTTCTGTAGAGCACATAGC 

>Concig30 

GGTTAACAATGGCACAGGGAAACAAACAGTTCCAGGTGCAGGGGCTCTAA 

ATCTATCATAAGATGTTAGGTATGGGGGCTCTGCCGGACACAAACTCAAG 

GCTTTATGCTGTTATCTCTTGAGCGAAATCCTGGGAACTTCGTACATTGC 

TTGCTTCAGTACCTTATCAGTTAATCGGACTCTTTGATATGTTGGGAGTC 

AGCGTACACAAGTTAACTCCTTGAGGAAGGGGGTGGGTAAGGAGTCCTTG 

ATGTCTGGTAAATGAAGGAGCGAAATCGAGTTCCTCTGGCTTTCTCAGCT 

AAGGGAGAGCTTATTCATGTGGAAACAAGGCTAAGTGATTAAGGGAGAAA 

GGGAGAGTCTGAAAACAAGGTTAGGTATTACAATGTCAATAAAATTGGTC 

TCCTTATACAGTCCTATGGTAGATTTCTTTCCATCTTTAATCTCCCTCTA 

GCACCACCAGACTTTTTCTCTCTGTACCTTGAGATGTAAATTTTGCTATC 

TGAATTTTCGTCTAAGAGTTGTTTCCTTTAATATGCAAATTTAGGGTTAT 

TTAGCTGACAACTGCCAAAGTAGTGAAACAAGTTATCAAGAACTTGAACG 

TCTAAGGTAGGAAAAAAAAAAGTCTTTATGAATCTATAAGATGTACTTCT 

ATTGGCATGCCTAATACGTCTATGTATTTACGTGTTGTGTACACAGTTTT 

TCACTACTGAAAATATATAGAGGAGTTCTAATTAATTGACTTAAGACAAT 

AAAAGCGCTTGAATCAAATACCTTATCAGGAAAAAGGAAAAGACAAGTCA 

AATGCTTGTTCZAAGTTTATATAACTTAAGTAAAATCTTTAATAAATAAGC 

TAGCTTTAACATTATTTGAAATGTCTTAAGAATTGCCAGCAGGTTCTGGG 

TTACAGAACTAGTGGGGGTGCAGTGGGGTGAGGGTTGGTGGGGTGGGGGG 

TGGTACGGGGGCTTTGTTITTTCTTGCTGCCCCCTTCTGGGTTGGGGAAG 

TGGCAGGACCTTGGCAGCACCCCGAGCCGGCATGGCGTTAATAATGGAGG 

GATGCCAGACCCAAGTGGCTAAGGCCCGGCTGCAGAGCCAAGTTGGCATT 
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TCCAGACTGGGGCTCGGGCCGCACCCTCTCCAGGACCCTCCCCTTGTACC' 
GAGCAGATTGTCGCGGGCAGTTTGGGCCAGCTGTCCTGGCGTGGAATTTC 
CCAAATTCAACAAATCCTCCAAGAAATCAATCCATCCATTCATCCATCCA 
TCCATCCATCCATCCATGCATCCATCCATCCGTGGCAGATTATGAAGCAT 
GGATCATTACTTTTGGGATGTGGATATATTCAGTTAACAAGGAGCAGCTT 
TCAAGAGCTGGATTTTATGCTTTGGGTGAAGTTTAGAAACACTAGCTCCC 
AC* 

>Concig31 

ACCTCATGTGCTCTAGCGCCTCTTACCTCATGCCCTCCACTCTCAGTCTT 

GCACTCACCCTGCCACACTCAAGGGCTTCCCCAGGTTCCTTCTTAGATTC 

CACCGATAGCTCAGGGACTTTGCACATGCTACGGTCTCTGCCTGGCTCCT 

CCCCAGATCTTCTCATGCCTAGCTGCTTCTCATCAGCACCCCTCAGAGAC 

TGTCCCTGCCCCACCTCTCCAGGTTCCATACCTGCCACCCTCCCCCAATC 

ACGTAACAGTTTCTTCACAGAGCGAGTTACCATCCCAGTATTTCCCTAAC 

TTATTTTT7GTGACTGGTCTGTTGCCTGTCTCCACCACAAGAACATAAGC 

TGCATGTGAACAGGAGCCTTGTCTATCTTGTCACCCCAGTGGCTGTGACA 

TAACCTGATACACATTAGATGCTCAATGATGTTTGATGAATGAAGTGCTG 

GTAGTCCAACTGTGTTTCCTTGTCTGTGTAAGTATGTCTGTTGTGGTTTC 

CTAAGAACCTACAGCTCTCCCACTGTGACTCCTGTTCTATGGTCCTGATT 

TGCTGGACTAGAATCCTAACCTACATGCTTACTCTTAGTGTCCTCCCCCA 

GAGGCTGAATCCCAGTCCCTAAACCTCCACCAAATGGCTAAGACCtAGCT 

TCCAACCAGACAGGCCTACGCTGAGACCTCAGCACCGCCCTTCTGCGGTC 

TCATCCTTAACGCATCCTTCAGGGCCCAGCTTAAATGTCTCTTCTCCAAG 

GAAGGCTATCCTCTTTCTGCCCCTCAGTGCTCTCCATGCCTCCTCTATGC 

CTCCATGCCTGCTTTCCAACCCTGCAGAGGTGGAGAAGTTGCTAATCTGC 

TGTGTTGACATGTGCTGGGGTGCCTTGGGCCAGGGAGCAGGCTGGTGGTG 

TGCTGATAGCCCGTGGCTGTGCCCAGGTCCATGCTCACTTCCTGAGCCCC 

AGTGGAGTAGGCTCCCTTTCCCTTATTGCAGCACTCAGAGGAAGGACGTG 

CTTCTTAGGACAGATCTGGCCAACCTCTCCCTCGTGAGAGAAGGCCCAGC 

CATCCTCTTGCCCTCTTTCTTTCTCCTGCCCCCGAGTAATAAAGGTGCCT 

GGTCAGAGCCTTCTAGAAGGAGACCCAAACATCCACCACACATTCCCAGT 

TCCAACCGTCATCCACATGGCTGGCTGTGCAGGTAAACGCAGAGTCTGTT 

TCACACACCCAACCATCTAGTATTGGATGGGAGGACAGTAGCGTGACACT 

CTTCTCCAGCCTTGAGCCCTACTGTGGGCCCCACCCAACCCAGATACCAG 

AGGAGCCCTGTACTGGGATGCTATTGGATGCTTGTCCAGTCATGTACAAA 

GTTAGCCCTTTGTTATATAGAGTTAGCTACGTACATCTTCCTCTGTAGGG 

AACCCAAGAGGGGAGAAGAGATATGTAGTAGGATTTAACCTGCAAATCCT 

CTGCTGAGCACCGTGCACTACATACAGTGGGTAGCATGTGGTAGGTGCTC 

AATAACTATTGACCGATCTATTGAATACACGTAAGATCGTGACACTATCT 

AAAACGNGGGGTGTGGGGGAAAAACCCCCCCCTTGTTTAGGAAACCCAAA 

TTGGACCGTGTTGGC 

>Concig32 

GCGCGATTGTGCTAAAGATCATGCATGCCTGATCAAACGTCCCCATATGG 

CGTCTCAGAGTCAACTCCTTCCCCATCAGTGCCCTGACTTCGGCATAACA 

AACCTGGCAGGTTAAGTGATTAATCGGTCCTGTACAACTGTAGCCCTTAG 

CAGGAAGCACTAAGCTTCGTTTTCATTTATTTCTTCCCTGGAACTGCAAG 

AAATGAGGGATGCCTTCCGCCATGAAGTTTTGCTGATTGTCCACTTTGTT 

CTCAAGGAGATATTCACAGTTTTTAATTTGTCTTTCTCTCCTGCATGGTC 

TCCAAACCTGTCCAAAGAAGCCAGCTGGCTCCATCATCTGTAAAATCACC 

ATTGTCACCAGAGCACTTGACTTCCTGTTGCCCTACAATCCACCTGCACT 

TTATTTCCTGCCACCATGATAATGTAGTGTTACTACATTTTACATTCAGC 

TGTAAGAAATGTTACATTCATTTACTTAAATCAAATTAAGTCTGCTCACT 

CAGTCCCCCACAGTGACCAACTTATAAAAGAGAAGGTACATTTCAGTCAT 

CACTGAGGTTCTCTCTACCACTGGAAAACTGAGGAAGGGTCTGGAGTCCA 

CAGTGGTTAACATCATTGCCTCTGTTTTTTCTCCTACTCAATGTAACCAT 

CCAAGGTTACTCACAAATTCACAAAAAGAGGTCTTCACCTCTGCTCTCAA 

GACCCAGAGGGCTGGGTTCTAAACTCAAAGGCCAATGTTCCCCAACTTTT 

TGCATTGTTTCAACATTGGGGAAAACTCGAGGGGATTCAAGAATGGTTAT 

ATAAGTTTTGTGGAAAAATGTATAATTTTTTAAAATTAAAATACAAAGTA 

TTATGGAAAGCACTAAATATTGAATTTATATAAATATTCCAAATATTTTT 



FIG. 4 (11 of 61) 




BNSDOCID: <WO S906<426A1J_> 



WO 99/06426 



PCT/US98/16102 



ZTAAATTTTTAGTGAGAAACTTGAGCTTGCTTCTGTGAGATATTTATTTT 

AAAACAGATTTGACACTTAAAATGTCTAATCAAGCCTTTTAAACCATGAT 

CTATCTCTTCAAATTCTTCAGATGCCACCATCAATAAAGAAACTTTGTTC 

ACACAAGTAAGTGGTAGCAAATGGCAGGGTG7TTATCATTTTTTTTTTTT 

CTTTTTTTGAGACGGAGTCTCGCTCTGTCGCCCAGGCTGGAGTGCAGTGG 

CGCGATCTCAGCTCACTGCAAGTTCCACCTGCTGGGTTCACGCCCTTCTC 

CTSCCTCAGCCTCCCGAGTAGCTGGGACTACAGGCACCTGCCACCACGCC 

CGGCTAATTTTTTGTATTTTTAGTAGAGACGGGGTTTCACCGTGTTAGCC 

AGGATGGTCTCGATCTCCTGACCTCGTGATCCGCCCGCCTCGGCCTCCCA 

AAGTGCTGGGATGACAGGCGTGAGCCACCGCGCCCCCGCCGTGTTTATCA 

TTTTTTGCCTGATGAAATTTTCCTTGCCACTACTCTGGATGGTTTGATAC 

ATTT AAATTGTGCTTC CAGGGTACAATTATC CTTTAAATCTATAC CT CTT 

TCCTTTCTTTTATTGACAAATATAATGTTACACTTTTCTGTCATTGCAGC 

CACACCACCAGTACACAGATCCCAACAGAGTTGTAATATTTTATTAGTTT 

CAGAGTTTCAATATTTTATCACTTTCAATACTTCATGTGCAGGAGTTTTA 

TTTGGTACTTCTTTACAAAATAAATGATGTGCTTCCAAGCATTTCTTTTC 

AATAATTCCAATCAATGTTATTAACTGAGTAATACTAGTATCTGTTTATT 

CATAAATTCACAGGAAATGCTTTTTTACTTATTAGTCTTTGGAATTCTGT 

TGTTTGTATAAACATCTTTCATGATGGCTTTGTGTCTACCAATAGCACTA 

TTGCCAAAAGGCACCTTTTTCTTGTTCCTTTACTTCACTGGTCCGAAGCC 

TGGTACCAACAACTACCACACAGACTGGGAAATGAGCAATTTTGCCACGT 

GCCCTTAGCTATTAATGGTGGCACTCCATAACTAGCATCTTAAGCTCAAT 

TTCATGAAAGAAATGTGTTTCTTATTTTGTACTTGCAGGCACTTTTTAAA 

CTTGT AAT CTTTT ATT CATACTTT AAAATTAAAACAG AGTAAT AG AAC C C 

ATAGAAGGAAATCAATACCCACGAGTCCATACTGATATAAATAAATAGTT 

ACATAAATAAATGGGGGGAGAAATAACAGCTCTTCCTTACAGAAAAATTT 

CAATTAATAAATGAAGAAGGAATTAGGGAAATACAACGTTAC CATTAAGC 

AACCACAGTAATAATCATTACAGGCAATATCCAAAAATAAATTCCAAAGC 

CAGTGGGCAAAAGTTTGAGGAGATACAGGATATTAACATAGTCTCCAAAT 

AGCTCATGCTATTTATAAATTACAAAAGGAAACATAACAACTGTATAGTG 

AAGAAACTCAGCAGACACCACCTTAGCCAAGTGATCAAGGTTAACGTCAC 

TAGTAATAGGGCTTGTTGACATACTGGACTCCAATCTGATACACTGATAA 

GGACACATGACTTCTGCAGTATTCTTACCAAAAACAGAATTCTAATGTAA 

TTAAGGAAAATGTCAGACAAACCTATTCTGAGAAACATTCTATAAAACAA 

CTAACCAATACTTTCAAAATTGTCAAGGTCATAAAGACCAGGCGATGGTC 

ACAGATTTGAGGAGACTAAGGAGATACAACAACTAAATACACAAATGGAA 

CCATGGCATTCTTGATTGGATCTTGAAACAGAAAAAGGATATTAGGAAGA 

AAAGCTGATGAAATTCTAATACATTCTGTAGTTTAATTAATAGTATTGTA 

CCAATATTAATTTCCTAGATTTGATCATTATACTATGGTTAAGTTTTTAA 

CATTAGAGGAATCTGGGAGAATGGTATATATGAACTCCACTGTTCATTCA 

ACTTTTTCAGTAACTATTATTTCAAAATAAAGTT 

>Contig3 3 

GGGAGCGGCGGCCCACGCTGATCTCTAAAGCTTTAGACCACATTGGCTCG 

AGCATGGTCATGGCCGTTTCCTG 

>Contig34 

GACGTCTTAGCGCTATATTATAAAGAAATATTCACCTCCCTGCTGAGCTT 

ACAGGGTGTACCTAATGTCCAACAATATGAAATCTCTTCAATGAATTGCA 

GCACGTCCATATATAACCCACATGGAAGCTGTCCTCTTTCCTCACCTTCG 

AACTTCCCATGCCAAAGAGGGACCTCTTGGACTCAAATACATCTTAGCAA 

TATAGAAGATGCTGGAGACTTGTAGGAGAAGTGGAGAGGGTTTACAGTGT 

AGCCCCACAGAAAACAACTTATGACCCCATCAGTCACTTGTCCCTTTTTT 

CCATGCCTCAGTCTAGTCAGGAAACCACTAGATCCTGGATGGCTTCTTCT 

CCCTTCCCCTCCTTTCTCTTCTCCTCTCCCTCCCTTGCTCCTCCTTCCTC 

CATCACCCACTCCTTACTTCCAACCAAAACTTGACTAGCTCCAGTCTCAT 

CCCTCCTTATTGAAAACTATTTTACTCAGCCCTCCTCCCCCACTCCTGCC 

CAATCTTTATTCCTTACCTACATCAGACTTCACCAAAACAAAGGCCAGGA 

TAATAAACAGGACAAACTCTTTCAAACACATTTTAATGACCATATTTTGT 

TATTTTGGT ACAATTTG AGGAGT C C CAATC C C CAGGG AAGACTAACAAGA 

AGTTCTCCTAACAAAGGTGGGTCTCCCCTTACTAAAAACTCCTGTAATGG 

CTGAAAAGAGCATGAGGTTTTCTGCATATCATTACACATTCAATAGAACG 
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TCATGCAGCTGTTAAAA**x'GATCTGTAG«^GGCTATCTTGTGACAGAAKvj- 

GCATTGGAGATATACTGTTAGTGACAAAAATAGGTTATAAATGAATTTTT 

CCATGCATGCCTCTATATTTATAAATACACACACATAAAAGACAGGAAGG 

ACAGACATTAAACATTCATAGTGCTTAAGATGATGCATAGTATAATAGTT 

AGGACCATGGCCTTTGGGACAGAAAACTACAGCCTCTCTCCCACTTATCA 

GCCATGGGACCTTGGGCAATTTGCTCAGCCTCAAAGCCCCTGTTCCTTTA 

TCTGTGTGCTGGGGTTGTTGTAAGAGTTAAGTGCAATACACAGAGAGAGA 

GAGAGTACCTAACATGTATTATGTGCTCAGTCAATATGCATCATAGTACT 

CATTGTTACATATGTTCCTAAGTGCTTTATACGTTTTTTCCCTAAGTTGA 

CCATCTGTTTTTGGCATTATGAAACATAATGATCCTAACAAATTAAAATT 

AAAAACATAAAGAATATTTGCCCCAAAAAAATAAAGAACATGAATTCTTC 

AAGTAGCCAAGGGGCCATAGACAGAAGTAAGCCCTTGGTGGGGCTTAGTT 

GAGAGAAGTCTCCAGAAGGTCTTTCGTGTGTTAAAGAAGAGGGTAACAGG 

GAGGAGGTGGGGAGAGATGTTAACTGAGTCTAAATGAGCACCTGGAAGAA 

GAGATGGGACAGGCCACTTCTGCCTGGACTCCCTGATTGTTAAGAAGAAT 

GAAAAAGAGCAGAAGTCTTCCCTGAGCCCAACTTCACTCCCTGACTTAAC 

CTAGTCTTTGCCCCTTCCCTCTCACTCATGGCTACTTTCTGTGGTCACCT 

TGTTGTAGAAATGGATGTGCAGCCACCTCATCTTTTTCTACCTCCTTCAC 

ATGTT^^AGATAATTTAATGTAGTAGAAGACGGTTACAGCAAAAAATTAC 

AAAAATCAAAATATCTCTGCTATCTACTGTTGCATTTCTAACCATCCCAA 

AACAGTAGCTGAAAACAGCACTCGTGGTCGAGCGCGGTGACTCATGCCTT 

TAATTCCAGATACTCCGGAGGCTGAGGCAAGAGAATCACTTGAACCCGGA 

AGGTGGAGGTTGCAGTGACTCAAGATCATGCCACTGCACTCCAGCCTGGG 

TGACACAGTGAGACTCCGTCTCAAAAAAAAAAAAAAAAAAAGCACTCGTG 

TATTTTGTTCAAGATCTGTGGTTTGGGCAGGGCAGGGCTCAATGAGGACA 

TCTCGTCTCCGTTCCCGCAGTGTCAGGAAGTGTAACTGAGACTGGAGGGT 

CACACAGAAGATGGCTCCCTCAAGTGGCCAGCAAATTGGTGCTTACAATT 

GACAGGGAGCTGTTGACCAAGGGCCCCAATTCCTCTTCCTATGGCCCCTT 

CTCGGGCTGCATGGGCTTCTTTACAGAATGGCAGCTGGATTCCAAGAGCA 

AGTATCACAACCTACAGAAGAGTGGAGGAATATTGAAAGTTCACAGTCTC 

TTAAGACGTTGGCCCAGAAACTGGCAAAAGCTTCATTTCTGCCATGTTCT 

ATTGATCAGTCACAGAACCTGCACCAATTCAAGAGGAGAACATATAGAGG 

ACATCTCTCAATGGGATAAGTGTCAACAAATTTGCATCTATCACAATCTG 

TCTTTTGGGTACAAACTATTTCTATTCCTCCATTATGCAAAATATACTCA 

CAACCTCCCAGGGGTCGCAAAAGCCTCATCCATTTATGGCAAATGTGGCC 

CTTTTAATTTATATAAAATAATTTGCGGGGGCTTCCTTTATATTTTTAAC 

TCCCCTGC 
>Conci325 

GTGCAGAGAAGTGATTTAAAGCCCTTCAGAAAGAATGCTTTATTCCCGTG 

GAATTTGGTAACTTGCTTGGGTGTGGGGAGGTTTGTCAGCTTTCTCCACT 

CAAATTATCAGACCCTTTCCATTTAGTGGTAGACCATTTCCCTCGTCCAG 

GCCAAGGGCACATAGTACAGAGAAATAGGGAGTTGTTACCCAGGGAGAGA 

ACTTGGCTCTAAACCTGTAATAGAAAGGTCAGTTCTGGTCTGGAGGGTCA 

ATTTTGATCTTTGGCTCAGATCCAGGAATTGGAACCAAGGCTTTTGAACA 

TTTTAAT G CAGGGGATTAAAAAAATGATAC GAGT CATTCACGAAT AT ATT 

TGCTTAACATCTAAAGAGATCCCTCAAAACACTAGAAAAAATAAGAACAA 

AAATCTAATAAAACAAAATTTGTTAAACACATTTAC CAAATTTTTTTTTT 

TGGTAAAAATTCAAATGTCATAAATAAAGCTAAAGTTCCTCTTGATGACT 

CGCTCCTCTGCCCTATTCCACTCCAAGTAACCACTATTATCAGTCTTGCC 

AATACCCTTCCAGACCTCTCTACCTCTATATACCATTAGAAGCACATGGT 

TTTGCATTGAGGATGTGCAGTGTTTTGTTTTACGTAAATGTTATCACTCT 

GTTCTTGTTCCATAATTTGCCTTTTTCTCTCAATGATTTGCTTGGCTATC 

TTTCTATTTCAGTAGCATCTCCTTTCTTTTTAACTTACCATTGTTTATTT 

AACCTTGCCTCTATCAACAGATATGTAGGTTGTTTCTAGTTGATTTCATT 

AAGTAT^ATAAACAACGCATCAGTAGATGTCCATAAATTTCTTTACGGA 

AGATGGCAAGTAGTGGAATTGCTGAGCCAAAGAACATGTTTAAAAAACCC 

AAAAAAACTAGACGCTACCAATTTTCTCTCCAAAATGGCCATAC C CACTT 

ACCCATACAGAGATGATTTGGAATCTGGCTTCCTCACAAGGTGAGATGCC 

-TCACAG" ! ~ r " T *CATTCTTCCTGGCATGTCTTCCCTTTTGTATCTGAGAGAG 

CTGGCAGAATTGTGTCACTAAATCAAGGATAGAGGGTCAAATGACAGCTC 
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AAGCTCACAGGCACCTCl'GCTTTCTTC^ 

CCAGCTCTGTTCCATCTTATAGAATGGTTGCCACTTGGGTGTCTGCTCCG 

ACAGCCATGTCATCCTTTGCACTGCAGTTATGAAGCAGACAGAGCTAGGA 

GAGGGGCTTTGCCAGCCTCTGCCCTAGCTTGGAGAACTTCAAAAAAGGAG 

GGTATTGAAGTTGAACTCCCCCAAAAAGGGGTGGTCCCCACACCTCAAAA 

AGTGGTGCCTCCGAAAGAAATGTAAAATTCGTGTGGGGGGGGAAAAAGGT 

TATTTAGAAATTGTTGGCTTGTCGTGCCGAAAGTATGTGTGGTTACGGGG 

AGTACGGAAATTTCGAGGGGTGGGGGCGAGGCCGTGTGTCCTTTAGCCCG 

GGGTTTTCCCGTCGCATGTTTAAGGGGGGGGAAGAGGGGGGATGTTTTCT 

TTCCGCGAAGGTTTTTGAAGAACGGCGTGG 

>Concig3 6 

CCCCCCACCGCCACTACTCAACCGGCCGTTCACGAAACAACTCGCCACAT 

CCACTAACCCGCTGGCTCACCACCCACCGCCCTCCCGATCCCCCCAATCC 

AAACTCAACCCCCACCACCAAGCGCCTCCCCCCTCCCCCACCCTCCAGCT 

CAGCCCCAACCTACCACCAACCCCGACTCGCCCACCGAAAACCAACAGCA 

AACCCAAATGCCCACAAAACCAGTGTCCAAACCCTCCTTCCCATCAGTTT 

GGTGGGCCCATCACCGCTTCCCCTGGCCCAGGCTCTCCTTTTGTGCGCTT 

GGAGCAGCAGACTGATCTCCCAGCCTTCACTCACTTCATGTGGTAATCTG 

TTGTGTTCATCACTGTCAGAATCTTCTGCATCCCCTCACTACTCTGCTGA 

AAACACTCTAGTGGTTCCTCATTGCTCATTAATGAAAGTCTAGATATTAA 

ACGTAGAAGGCCCAGCACAATTTGCCCCTATGCCACCTACCTCTCTAATC 

TTTTCTCCTTACTCTGACAGACTCTCCGTCTGTCATTTATGTATTCTTTT 

ATTGCTCTCTTCTACTTTTAGTATGAACTGGATTTATGGATTTTTTTAAC 

ATTGCTTTCAAGTATGGAATAAAGAATTTTATTTATTTATTTATTTATTT 

ATTTGAGACTGGGTCTCACTCTGTTGCCCAGGCCAGAATGCAATGGTGCA 

GTCATATCTCACTGTAACCTCGAATTCCTAGGCTCAAGCCATCCTCCTGC 

CTCAGCCTCCTAAGTAGCTATGACTACGGGTGTGCATCACCACATCTGGC 

TAATGGAATAAAATATTACAATGCCTAATCTTAATTTTCAAAATTTTAAA 

TTACATTGTACCTAATGCCCATGCATTTACTTTTTTCAGTGGGTCAATAG 

CCCTCACTTTGGCAAAGGTCCCAGGCCCAAGGTAAGGCCTTACTTTTTCC 

AAACTCATCTTTTGAAAGACATAAGTGCCTGTAAGTTGTACCACATTAGG 

TTCTAGGAATTTTTCATCAAAGACTTTATCAGACTATTTTCCTCTAAGTT 

GAGAAAGAGCTGGGGGCAGAATATGGCACTGAATGACTGAAGAGAAGGCA 

CTGAAATCAGGCCAGAGGTTGCTGGAAAGAGCAATGAGGAACACCAGCAG 

CAATGAGGAGCCGGTGATGATTTTGGCTTCACAGGGAGGTGTGTACCACA 

CCGATTTTATCTCTACGTGGATGAACCACAGCTGTCGGCTCCCTTGTCTC 

CAGGACATCACACTCTCCACATTCCCTCCCATCTTCCGGCTTCTGCTTCC 

CGGGGCCCTCATCTGCCCCATCCTGGGTGAACACTGGTCGGTCAACTGCT 

GGGCGTACCTTCCCGCTCTGCACACCCTCCCTGGCCACCCCACCCACTCT 

CACGGCTCGCACTGCAGAGGAGCCGCATCTCTAGCTCCAGCCCATCTGCC 

TCTTCTGAGCTCTAACTTCATGTAGGCGACTCCTGCCGGTGTTGCCTCAC 

AGGCCCATCATACTTCAAAGCATTTTCCCCTCAGAACACCATGTCCTGGC 

TGCTCCCTCCAGAAGATACATCTCTCAAGCACATCCCCGCGGCTCTCACC 

TGGATGACTGCATTCACCTTCTCCCACATTTGCCCCCTCCTTTGGATGTA 

TATAGATTGTTTTAAAATACAAATCTGATGTGCTTGCTCTCCTGCTTGAA 

ACACCTCAAAACTGCCTTCAGGATAAACCACTGCCCTTGACATGTTCACA 

GGTTGCCCATGGCCTGGCCCTGCCCATCTCTTCAGCCTCATCTCATGCCC 

CTTGCCCCTCGCTCTCTGGGCTTCTGCCTCCCTAGCCCTCCTTTAGGTTC 

TCTAACACACCATAGTCCTTCTAGTGTTGGGGCCTCTGCAAGTGCTGTTC 

CCATTGCCTGAGACATGAATCCCTCTCCCTATCTCTACCTGCACCTTCAT 

CTGATTAATCCCTACCCTTCCTACTCATGATGTTGCTTTCTCAGGGACTC 

TCTCTGACTTTTTAAACTAATCAGGGTCTCCCCAGTATATATCTTCATAG 

CACTCTGTATTACTCCTTTCTTAATGACCACCTGCTGTAGACAGAATGTT 

TGTCTTCCTCCAAAATCATATGTAAAACCTTCCACCAGAGCGATGATTAG 

AGAAGCCTCCC 

>Concig37 

GACTGACATTCAGAAGATATTAATAAGAGCACTAATGATGGGGATTGCAA 
CCATGTCTTTACTGACTTCCAGAAGCTTCTTACAGTAAACATGAAATCAC 
ATAATTTCTTCCACTTTCCTACTGTTTCTTGTTCTGGGCTCTGTCCTGCT 
TACTGTCTAATATCTTGGCCCCTTAAAAGTTGCTAATCTTCCAAACCTCA 
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TTCCTGTGACTGGGCCL, ^TGGTCCTTG'i. * sZATGGGCCTTGAAGATACl 

CTGTACACTTATCTGGAGCATCCAGTGCCTACCACCTGACCCAGATTCCT 

CATTGCGCTCCTCCCTCCTCCACCTAATGGGATTTGCTCATACCCGTGTG 

GGACCCCTCCCATTTTCCCCAACTGAATACTTATCAAGACAACGCATTGC 

CATACTCCCTCGTACCCTGCTCTGGGCATCAGACTGAATGTTTGTTTCCA 

TTGAGGATCTGCAGCTGCATCAGTTTCCCCAGCACCGTCCAACCCCTTGA 

GCATGGCTAGTCCTAAAGCAGAGAATTAGCCTTTCTATCCCTGCTGCTAT 

ACATGCTGGGACAAATAATAAGAAATGACAGCATTTTATGATAATGCAGG 

CTGCAGGAGGCAGGAGGCAGGAATCAAATTCGTGCTTATCAAATAGTGCT 

CCAATTCTTTGAATATTGGACTATAGAATATGTCATGGATCTATGCTCAG 

GTGGGTTCCCTATTACTCACTCCACTGAGGCCAGGTTGTGGGATTAGCTG 

TCCAAGAGGGAGTTTCAGTCTCACAGCATAGGGTCATTCTGAGAATTACT 

GGCCCACACTTGTGTGGAGACCTCCAGAGAACAGAATCTGGGTTGGTGCC 

ATGTACTTCCAGGAGGAGAGAAGTGGCAGGATGCCCAGCCCCACAATCAG 

AGGGGAAGGGGCAGAGCCACATGTATGAAGATCCTCTCCCCAGTACGTGC 

CAATCACAGGGCTTCCTAGCTTTTGGGCCAAGGAAACAATGTGGGAAGCA 

AAAAAGGACAATTTTCTCCTCCCTTTGCATGAAGACTGAGCAGTTTTACC 

AGATTCCCAGGGAAACACCCTTCCACTCTGGGTTGAATGTGAGTGAGAGA 

CATTCAGCTGGAACACTAGAAAAACTATTTCCTGAGCCACTCACCTTTAG 

CCCTAGAAAGTGTTGGATTTGTCCTTCATCTTTGCCACAGTAGAGACTGC 

TGATAGCATCAGAACTTGGGCTCTGGAATTAGACAGATATGGGTACAAAT 

CTGAGCTCTCTCACTTATTAGTGTGGGATGTAGAGCAACTTTTAAAATCC 

TTCCAAACCTCAGACTTCTCATGCATGATGTGAGGATTGTAATAGGGCCC 

ACCTAATAGGGGTTTTTGAGAATTAAAAAAGTTATTCAATGAACAGCATT 

TAGCAAGATGCCTGACCATTGAGAAAATAACAAATTGTTTATTATTATTG 

TTATTATTAAACATCTTTCCTGCACCTTCTGACTGGGGGCATCGTATCAT 

CAGAAATACTTAGGATGGGATGGATTCCTGCATGGGCTGAGTCAAGGGTG 

CAATAATGGAGGAGTGAAGAAGGAAGAAATGGAGGCAGAAATCCCCAGGA 

GCCCAGCATGGTACAAGGCTGAGCTAGTGCTGCAGAGCCTCCTTGGAACA 

GCCACAGAGCTTGCATCTGGCCCTGGAGGAACCTCTTCTAGCTGGCAGGA 

CCAGCCACAACAGTGGCCAGGGGATTTCCCAGGGCGTGGGCTCCTCAGGA 

GTTCATTTGGACCAAGCCTGCCTGGAGAGGGGTTATAACAGGGATCCTTC 

CCTACTGGCAGGTGATTTACCCCTCGGTGAGAAGCTCAGGCATTTGTTTG 

ATGGAAGGTGGAAGGCCCTGTGCTGGGCCAGTGACTATCAGGGATGGGCG 

GGTGGCTGGAAAATAGCAAATAAGACAATATGATAACACAGTTAACCACC 

ACACTATGTGAAGCTACAATATGGGTATCTGTAATAGACAATTCCAATGT 

AGAGAATAATTTTAAGGTGTCATTCTCCCCGCCAATGCCATAAGCACACG 

GCCTCTGCCTGGGTTTCTCACTGTGGAATGTCCTCCTGGTCTCCTCATGC 

CCAGAGAG7GGGAAGTACTCCTACTTTAACACCGGCTTTCCTGTCATTTC 

CNTGCAGCCCTCCTCAGCCCCCTCTGCACAGGGAGGTTTCCTCCCTGCTG 

CTGCAGTGCTTTGTACTTGTTAGTGGTACCTGCACACAGGTATTGGTGTC 

CTTGTCTCACCACCCTACATCACTGTAAGCTCCCCAGGAGCAGGCTTCCT 

GTTTGACTCACCTGTGATCCTCCACCTCCCACCCTGTAGTGCCTCAAGCA 

TTCTGTAGAGCACATGGACGCC 

>Contig38 

GACTAATAAGTACTTCATTATTTGGGTATTTTCCAAGAACAACATATTGT 

AGGAAACCATTCTTTCTAAAAAAAAAAGTGTCCTTTTAAAAAGGTGAATA 

ATTTTTGTCTAATTCAAAGTTTATTGAAAAGTTATGTATAAAACAAGGTA 

AAAGGAACAAGGAAATAAGGGAAATGTAAAGAAAATTATAGAAATAAAGT 

GGTATTTTTTGGTAAGAAAGCTTAAAGAGAAATAATTTTAGGTAAGAAAG 

AATCTTACCTAAAATTTTGTGCTAGAATAAAGTGACTGGCTAAGAAAGGG 

ATGTTCAAAGCTATTTATGACAAACCCACAGCCAATATCATACTGAATGG 

GCAAAAGCTGGAAACATTCCCTTTGAGAACTGGCACAAGACAAGGATGTC 

CTCTCTCACCACTCCTATTCAACATAGTATCGGAAGTTCTGGCCAGGGCA 

ATCAAGCAAGAGAAAGAAATAAAGGGTATTCAAATAGGAAGAGAGGAAGT 

CAAATTTTCTCCGTTTGCAGATGCATGATTGCATATTTAGAAAACCCCAT 

CATTTCAGCCCCAAAACTCCTTAAGCTGATAAGCAACTTCAGCAAAGTCT 

CAGGATACAAAATCAATGTGCAAAAATCACAGGCATTCCTATACACCAAT 

AATAGACTAACAGAGAGCCAAATCATGAGTGAACTCCCATTCACAATTGC 

TACAAAGAGAATAAAATACCTGGGAATACAACTTACAATGGACATGAAAG 
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ACCTTTTCAGGGTGAAw - GCAAACCAC^ ^CTCAAGGAAATAAGAGAGWlA 

ACAAACAAATGGAAAAACATTCCATGCTTATGGATAGGAAGAATCAATAT 

CGTGAAAATGGCCATACTGCCCAAGTAATTTATAGATTCAATGCTATCCC 

CATCAAGCTACCATTGACTTTCTTCACAGAATTAGAAAAAACTAATAGCC 

AAGACAATCCTAAGCAAAAAGAACAAAGCTGGAGGCATTG7GCTACCTGA 

CTTCAAACTATACTACAAGGCTGCAGTAACCAAAACAGCATGGTACTGGT 

ACCAAAACAGATATATAGACCAAAAGAACAGAACAGAGGCCTCAGATATA 

ACACCACACATCTACAACCATCTGATCTTTGACAAACCTAACAAAAATAA 

GCAATGGGGAAAATAATTCCCTATTTAATAAATGATGTTGGGAAAACTGG 

TTAGCCATATGCTGAAAACTGAAACTGGACCCCTTCCTTACAACTTATAC 

AAAAATCAACTCAAGATGGATTAAAGATTTAAACATGGCTGGGCATGGTG 

GCTCACGCCTGTAATCCCAGCACTTTGGGAGGCCGAGATGGGTGGATCAT 

GAGGTCAGGAGATGGAGACCATCCTGACTAACACAGTGAAACCCTGTCTC 

TACTAAAAAATACAAAAAATTAGCTGGGCATGGTGGTGGGCGCCTGTAAT 

C C CAGCT ACTTGGGAAGCTAAGGCAGGAGAATGGTGTGAAC C CAGGAAGT 

GGAGGTTGCAGTGAGCCAAGATCACGCCACTGCACTCTAGCCTGGGCAAC 

AGAGTGAGACTCCATCTCAATAAATAAATAAATATGGAACTCTCCCAACA 

CAATAATAAGACAAACCCCCAAATGTTTTAAATGGGCAAAAATATTTGAA 

CAGACACTTCACAAAAGAGGATATGTAAATGGTCAAAAAGCACATGAAAA 

GATGTTCAAC AC CATTGGTCAT C AGGGC AAAGAAAACTAGAAC CACAATG 

AGATGCCTCTGTACACCACTTAAATGTCCAAATTAAAGAAAACAAGTTTT 

GGCAAAGTTGTGGAGCAACTGAAATGCTCGTGTATTGCTGGTAGAAAAAC 

AAAATGGCATAACCATCGCAGATAATTTGTTGTCAGTTTCTTACAAAGTT 

AAACATATACTTATTGATATGACAGTTCCATTCCAAGAGAAATGAAAACA 

TAAG7CCACACAAAGACTTGTACCTGGGTGTTCATGGTAGCTCTATTCAT 

AATTG C CAAAAT CTGG AAACAAATCAAATGTC CAT CAGCAATGGAATGGA 

TATACAAATTGTGGTACACATGTACAATAGAAAACTACTCTGCAATGGAG 

AGAAATTAACCATTGACAAACACAAAAACATGGACAAACCTCAAAAACAT 

TATGCTGAGCAAAAGAAGCCAGACACAAAAGACTGCTCAGCGCATGATTC 

CATTCATATGAAATCACAGAAAGGGTCAGTTGAAGGTGCAGAGACAAAAA 

GTAGATCTGCAGTTGCCTGGGGATGGGGTGGGAGGTTGACTGCTCTGACG 

CGTAAGGAAATTTGGGGGTAGGTGGGGGATGGTGGGAATATTTTTTGAAT 

TGAATTGGGTAATAGTTTTAATAGGTAAAATATTGGACCCCACAGTATTT 

GAGATAGGTTTCAGTCAATTTAGACAGTTTATTTTGCCAAGGTTAAGGAT 

GCATCCGTGACCCAGCCTCAGGAGGTCCTGACAACCTGTGCTGAAGGCAG 

TCAACATACAGCTTGCTTTTATTCATCTTAGGGAGACATAATACATCAAT 

CAATGCATGTAAGGTTTACATTGGTTCAATCTGGAAAGGTGAGGGAACTT 

GAAGCAGGGAGCTTCCAGGTTACAAGGTAGATTATTCTCAACAGAAAGGA 

ATGTCTGGGTTATGATAAGCGGTTGTGGAGACCAAGGTTTTATCTTGTAG 

ATGAAGCCTCCGGGTAGCAAGCTTCAGAGGGAATAGATTGTCAAAGTTTC 

CTATCAGACATAAGGTCTGTGTTGATGTTAATGCTGGTCAGCTTTTCCTG 

AATTCCAAAAGGGAGAAGGGTATACTGGGGCATGTCCAACCTTCCCTTCC 

ATCATGACCTGAACTAGTTTTTTCAGGTTAACTTTGGAATGCTCTTGGCC 

AAGAAGAGGGGTCCATTCAGATGGTTGGGGGGGCTTAGAATTTTATTTTT 

GGTTTACAGTGAAGACTTTTCAAGCTAGAC^CTTAAATGAGTATGTTGCA 

AAATGGCAATTTCTTAGCACGGC 

>Concig3 9 

GACGTCCTAAAGAAATGCTAAGGTAACTCAATTAACTATGCTAGAAAAGA 
GAGTTAAGTATTTAGGAGGATTTAATATGGTGTTAAAGTTGTGAAAATCA 
AAATGGAGACACTAATGTTAAGAAAACCCTGATAAATGGAGCCAGGGAAG 
GCCATGAAGAAAGAGTTCTCACACTTGTATCCCTGATCATGAAAAAGACT 
CTGCAAAAAACAAAACCTTGCACAAAGGCCATTGCAACCTTACACAAAAA 
ATACTACTTTAAAAGGACATGTGCCCAGCAACTGCCTGTCCAACCTCAGA 
CTGGCAATATCTTTGTTATTGATCTTAGTAGCCCAGCATAACTATTTCAA 
AACAGTGATGTAATGCTCATTTTTTTTCTTTTGAAAACTTTTGTCTTCCT 
GTAAAAACCTTTGTCTTCTTTACTTACCCTGAATATGCACAGAGTTTACT 
ATGGAGTGCATATTCCTGTTGCAATGCTCTATTCCCAAACAAACATCATT 
TTCTTTTAGAGAGCCTCTCTCTGTTTGTGATTTAGGTTGGTGATGTAAAG 
CAATGGCATAACTGAACACTGATTCAAAGAAAAGTGGCTTTTCTCTTTGT 
TGTATTAAAAAGAGGCCTTATAAATAGGATAGTAAGATTTGTAAGTTGAA 
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CTTAAAGCATGAAGAAAATTTAGGGGCCAGGCAGGGTGGCTCACSCCTGT' 

AATCCCAGCACTTTGGGAGGCCAAGACAGGAGGATTGCTTGAGCCCAGGA 

GTTCAAGACCAGTCTGGTCAACACAGACCTCATCTTTACTAAAAATAAAA 

AAATTAGGCCAGGTGCAGTGGCTCATGCCTGTAATCCCAGCACTTTGGGA 

GGCCAAGGCGGGAGGATCACTTGAGGTCAGGAGTTCGTGACCAGCCTGGT 

CAACACGATGAAACCCCATCTCTACTAAAAATACAAAAAAATTAGCTGGG 

TGTGGTGGCGGGCACCTGCAATCCCAGCTACTCGGGAGGCTTCAGGCAGG 

GGAATCACTTGAACCTGGGAGGCGGACATTGCAGTGAGCTGAGATAGTCC 

CACTGCACTCCAGCCTGGGCGACTCAGCAAGACTCTGCCTCAAAAAAAAA. 

AAAAAAAATTAGTCAGGTGTGGTAGCACACAGCTGTGGTCCCAGCTACTC 

GGGAGGCTGAGGTGGGAGGATCATCTGAGCCCAGGAGGTCAAGGCTGCGG 

TAAGAGCTGAGATTGTACTACTGCATTCCAGCAGGGGCTACAAAGTGAGA 

CCCTGTCTCAAAAAAAGAAAAAGAAAAAGAAAATTATGTTTTTAAATTTA 

TAATTATAATAAATTTAATTACATAAATTTAAGCTCAAGTAATTGTAAAT 

ATTCTTTCTGTGCACATAAGTTATTCTTGTATTGACCCCACAGGAGCTGG 

CCATTCTTCAAGTCAGAAGGCCTGAGAGAGGAGCTGCCCAGSTGGTCTTC 

ATGGGGCTGTGCGGCCAGTCATCCCCCACAGGTTGACAATCCTTGTGTAC 

TTCATCCTCGTTGGATCCTCTGTATCCCTGACGATGAGCAACTGTGAGGC 

CCGTTTCAGCACTGAGTTCCAGTCAGGAAAACATCCACCCACCCACCACA 

CGCTCACACTTACACACACATTCACACATGCACACACGTTCTGGCTCCGA 

AAAAGAAAAAAAAAAAGCAATTTAAAATAATTCTGATCCTTTGC'TTATTT 

CCACAAACTCCATGAAAATTGTACATTGTCCAAGCAACATTTCTTAATAT 

-^CTCTTTTTCTCTCATATCCATTTTCCTTACTGCTGTCTCCACCTTTCTC 

TTCCAAACTCCCTGTTAAAATCCCTGCCCCAGCGAACTTTTATTCAATTT 

TGTGGAATGGAGGCTGCTCTGATTTAAATTAAAAAAAAAAAAAAAATCCC 

TACTCCATGTCCCAGATCCCTAGTTGTTTTTTGTTTTTTGTTTTCCTGAG 

ACAGGGTCTTGTGTCTTCCATGCTGGAGTGCAGTGGCATGATCATGGCTC 

ACTGCAGCCTCAACCTCCTGGGCTCAAGTAAATCTCTTGCGTCAGCCCTC 

CCCAGTAGCTGGGAGTTCAGGTATGTGCTACCATGCCTAGCTAATTTTTT 

TCTTTTATTTTGTAGAGACACGGTCTTGCCAGGTTGCCCAGGCTGGTATA 

GAACCCCTGGGCTTAAGTGATCCTCCTGCCTCGGCTTCCCAAAGTGCTGG 

GATTACAAGTGTGAGGCACTGCACCCAGGCTGGATCCCTGCATTTTTACA 

GATTTAGCATCACAAAAGTCTAAACAATTAGACTGACTAAGGCAGAACTG 

CCCTTATGACAGCAGACATAAGAAGGAAAAGGCCAAAACACTGTGTTAAA 

AATTATCCAAATGTGAGGAAAAGGCAAAGAGAGTAGGTGTGCCTTTTTAG 

TGTCTAAGCTGCCTGCCCAAGGGGCATCTGATGCTCTCAGGCAGGAGTCC 

ACAAATTTTTTTTTGTAAAAGATCAGATAGTAAATCTTTTCAGCGTGAAG 

AGCATGAGGTCTCTGTCACAAATACTCAACCACCATTACAACATGAAAGC 

^GCCAACAGACAACACATGACAAATGAGTGTGGCTGTGTTCCAGTAAATC 

TTGATTACAAAAACAGGCAAGAGGCCAGAGCTGACCCATGGGCCATAGTT 

TGCTGACCCCTTCTGTAAAGGAAAGTATTTTTGTTTGACTTGCTGTTTAC 

CATTGATTGAACACAAGGCTCTGTAGAGTTACTTGTTAACTTGCAGAAGA 

TTGATGAGTGGCAAGTAATTTTTATTCACCAGAATATANNATTATTCTGT 

TCAGTAGATAAGATAAACCCACTGTTATATTACTGTCTTGTTTAGAATGT 

GACTTTGATTCATTTTTTCACAAATTCATATTATTGCCCTAATTTGTATA 

TAAGTATGCTTCTTTTAAAAATATATATTTTTTAATAAATTTGAGACAGG 

GTCTCACTAGGTTGCCCAGCCTTTTGCTATAATGAGAGCATAAAGTGAAT 

TTCACACTTTAGCCTAGTGCATAGATGGGATTACAGGCACAAACCACTGC 

ATGCAGCTAACTTTGCTTCTCATTCCAGCACGTTCTATTCCNNNGNTTTT 

CATATACGCGTCTCTTAATGC 

>Contig40 

CGCATTCAGCCCAAGTTTTCTTCAGTGTTAAGGTTTTTGTTACTCTGTGC 
CCAAATGTCCTTCCAAAAAGGTTAAGTTTTTTTACCTTCCTGCCAACATT 
ATATGAAAGTGTCCACTTTTGTAGACTTTTACCAATGCTGACTACTTTTG 
GTTTCAAAAAAGCTCTCAGTAATTTTCTATTAATTACTTTTACCCTTTTT 
TATTGAGGGTGTTCAACTTTTTATTGTTAGCATATTCTCTCTGGGCTCCA 
TTGGACGCCTTGGCAGCTTTTTGGTAGTAGGTGCCTTTAGAAAAGTCCTT 
CTCGTCTGGCCCTTTCTGAGCAAATCTAGTGAACAGAATTGGCTCCATGC 
^ CAGCATTG CTTAATACGGTTGATC CAGGGC CTAGGACT CATTC CTTCAT 
T AC CAT C CACTTGCATTGTCTTAAAGCAAGGCT CTATTAATTTAATTTG G 
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' CATTTCCTGTCCCAGCTCl-rTAGTT^ 

TCCCAGTAGATGCCTATGTTGCTTCCTTTTAAAAAATTTTGGAGCTGTTT 

CCCTAGCCTAACCTTTTCTTCAGGGCAGGAGTTAAGTCCCTTCTACTGCA 

TTCCTGTGAAGATGGTGATTCAAGAGGCAGGGCACCTGTTGCTTTGTGAA 

ACAGTCCACTCTGCAGCTGGGCAGCTCTGTTACTAGAATGTTCTCCCTTC 

TGGGGAGCCAATATTTTGATGTCCTCTGTGAATCTCATCTGCTTATCCCA 

TCTGTTTATGTCCTTGAAGATGCACAGGTCTGACACCACGAGGTAGCCCT 

TAGAAATTTGATGGCATTTCTGATGTGTCCCCAACTCTTCTCCAACCACT 

CCTCCCAGAGCTTGTTTCTTAAGCCCCTTGTGGAGCTGATTGCTTTCCTC 

AAGGCAGCTCAGTTTTTCCCAGTTTGCTCCTGGTGGTCCTGAAATATGAT 

TGACTCCTGAATACTCCAGGTGTGAAGGAGAGTGGGGGTGGCCTTTCTAC 

TTGTCATGGCCTGGGTTTTAAGTTGCTGTCCAGTGGAGCAGAGGTGACTT 

TCCCAGTGAACTACATTTTTTCCCCTCTAAATCCTTAGCAATTTTGTCTC 

CAGAGGCAAGACCTGGCCAAACCATTTGTGTTGAGGATTGAATCAAGAAT 

GATTGAGGAGATGACAGTAGTCCCCCCTCATCTGAGGAGGGCGTGTTCCA 

AGCCCCTCAGTGAATGCCTGAAACTGTGGATAGTACCCAACTCTATATGT 

CTATGATTTTCCTATAAATTAATACATGCCTGTGACAATGTTTAATTTAT 

AAATTAGGCAAAGAGGCCAGGCGCAGTGGCTCAAGCCTGTAATCCCAGCA 

CTTTAGGAGGCTGAGGCCTCACCTGAGGTCAGGAGTTCGAGACCAGCCTG 

ACCAACATGGAGAAACCCCGCCTCTACTAAAAATACAAAATTAGCTGGGC 

ATGGTGGCAGGCGCCTGTAATCCCAGCTACTCGGGAGGCTGAGGCAGGAG 

AATCACTTGAACCCGGGAGGCGGGATTTGCGGTGAGCTGAGATCGTCTCA 

TTGCACTACAGCCTGGGCAACAAGAGTGAAACTCCGACTCAAAAAAAAAA 

AAAAAAATTAGGCAAAGAAAGAAATTAACAACAATAAGTAATGAAATAGA 

ACAATTCTAACAATATACTATAATAAAAGTTGTATGAATGTGGTCTCTTT 

CTCAAAATTACCTTTTTTTTTTGAGACAGGGTCTCACTTTATTGCCCAGG 

CTGGAGTGCAGTGGCACGATCACAGCTTACTGCTGCCTCGACCTCCTGGG 

ACCAAGTGATCCTCCCACTTTAGCCTCCTGAGTAGCTGGGACCACAGGCA 

AGGTCTCACTATGTTTCCCAGGCTGGTTTTGAATGCCTGGGCCCAAGGGA 

TCCTCCTGCCTTGGCCTCCCAAAGTATTGGGATTACAAGCGTGAGCCACC 

ATGCCTGCCCCAAAATTATCTTATTGTTCTATACCCACTCTTCTTCTTGT 

GATGATGTGAGGTGATCCATTGCCTCCTTGATGAGATGAAGTGAGGTGAC 

TGATGTGGGCATAGTGATGCAGTGTTTAGGCTGATATTGGCCTGATGATA 

TGTCAGAAGGAGGGTCATCTGCTTCGGTGATCCTGGATCATAGAGTCATG 

ATGATGTCAATGGTTGGATGTCAGGAGCAGACGATGTCAATGACTAACGA 

TAAGCTGGACAGGTGGGATGGTGGCACAAGATTTTATCACGCTACTCAGA 

ATGGAGCACAATTTAAAACTTCTGAATTGTTTATTTTTGGAATTTTTCAT 

TAATATTTTTGGATTGCAGTTGACTGTGGGTAACTGAAACTGTGGAATGT 

GAGACTGTGGAAAAGTGAGGGAGTACTGTATTATGGAACTGTAACTCTAT 

TCGGTAGGGGAACAGAATTCACATTTGtGGGGCCCAGGTCTCTGCATCTG 

TAGGGATCCAATTGTTTCATTTCTCGTTGTAGCAAAAACTTGGCTTTGGA 

ATCAGACAGATTGATGTTTGCTATCATTCTAAATGGGTGCAGCTACACTT 

TCCTCAAGAGGTAGTTCTGAAAATTTAACAAAATGTGAATTTCTTGGTAA 

AAAAAAAAAACCTCAAAAATATTCAGTTTCCTTTCCTTTGTGTCTGATGT 

ACTCCATCAAATACTGGGAAATATGTGTCTCTCATAGAAATGTCATGGAT 

CTTTGTAATTCTGATTATCCACAAACCTTGGGGATTAGCTGTTTCAATGT 

TCCTATTTTACAGATAAGAAAATGGAGCCTGTGGTAAGTTAAGTGAGTTA 

CTCATGGCTACTTAACTAATATTTTACTAGGTGATAGGCCAGAGCTAGAG 

CCCAGGTCACCTTCTTATCAATGCTCTGCCTTGTCTCTGTGCCTTCCTGT 

CTGTCTGTATGTGTATGTGCCTGTTGACAGTAAGGCATAGTTTAACCCAG 

TAGAACTACCGGTTTGTAATGAATTCCACTTGTAAATGACTGACCATTCA 

AGGAACAAGTGTTTTTTCTATGCTTGACACCTGTTTTGGATGCCAAAAAG 

GATACAAATGTAACTTCAGACACTCTGGGCCTCATTTTGCACTCATTAGC 

ATGTCCAAAATTAAAAAGACTGACCACACCAAATATTGGTGAGGATGTGG 

AAGAACGGGAACTTTCATACACTGCTGGTGGGGATGTAAAATGGTACAAT 

C C CTTTGGGTAACAGTTTGACAGTTTCTTAAAAAGTTAGACAT ATATATT 

TACCATATGACTCAGCCCTTCCACTTCTAGGTCTTTACCCAAGAGAAATG 

AAATG CTGTGCTTTTACAAATGTCTATACAGGAATGTACATAGCAAC CTT 

ATTT GTCATTGCAAAAAACAGAGACAATT CAACGTTGT CAAGAGTG AATG 

FIG. 4 (18 of 61) 

/// r 

'JSDOCID: <WO SG06426AV L> 



WO 99/06426 PCTYUS98/16102 

GATGAGCAAGCTGTGGTX ^TCTATGCA^GGTATCCTACTCAGCCAG, 

AAAGATATGGCTAAT 

>Concig41 

3ACAACAATGTCATGCATAAGATGACGATGGCCTGGGTGATTGATGCAAA 




"ATAGCACACACACTTACAAAAACAATACACAGACTCCTGGCCAATGGAC 
TTCAAAACTGAGGAGGATCATTAAATTTAAATGTTCACCGCTGCATGAAA 
-""TCCCTGGGTCCTGCCCTCCCTTCCCCACCCTCCTCCACTTGGGCCGGG 
GCACAGCAGTGATTCTCTCACCTCTCAGAGTGAGCCAGTGTTGGCTGCAT 
" r GAAGGCTCCAGATATGCAAACAGGGCAGATATTCCTGGACCAGGGTGCA 
CAGAGTGAGGCTCCAACGCACCCTATTAACTGCATGAAGGATGAATGAGC 
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CATGATTTAGCCACATTCACCAATAGGATAATCTGGAGAATTTCCTAGCT 
TGAGTTTCTGGGAGAAAGCAGATTTCTGGATTATCTGGTGACAGGTAACA 
GGGCCGAGTTCATCCACAGCCACCTGCAGTGTTAGCACCTTAAGCTGAGT 
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TTGCTCC r GTAAGTCAGGGCTCATTGTGGCTCCTTGCAGGCTTGACTTCA 
GGGTTAACAGAGAAAATGAAGGTACAAGTGCCTTGTGAACTCTGAAACTC 
CAAACCAGTCATTCTCAAAGTGCCGTCCACCAGTCTAGCACATCAGCATC 
ACTGGAAGCTTGTTTGAAATGTAAATTATCAGGTCCTCCAGAGCTATGTA 
tgAATTAGAAACTCTGGGAATGGGGCCCTGCAATCTATTTCAACAGGTCC 
-"CCAGGTGATTCTGATGCAAGTTAAAGCCTGAGAAACTCTGTCCTATACA 
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CCTGGTGATCCTGAGATGGGGTTTGGGGGACAGAGATCCAAGGTGCTACC 
AGGTGTGAGGAATTGTTAGAAGGCAAACCTGGCTGTCATCTAGGGTGCTT 
/-.<» -« -r-nfPTi/v!* TTrrnrrTrTTaraRrTnAiTCAGACTTTC 




C AGAATGGG ATTii v_ 1 vj 1 v_ v_aa i uoi-a i.vjv_v_ j.v__i vj\jvj j. v_«w * w * w~ * «- » 
ATAGCCTGGGCTGGGAACCACCAGAGGATTATCTTCCATTGACCAAGCTG 
ACAAACTCGCTTAAGGCTCTGAGTTTCACACTTGATTTTCTAGCCCCTGT 
CCTTCCATGGATCACCTGCCCCCTTCCCTCCTAATCAGGAGCACAGTCAG 




^^GACATGTGTTvjVjvj i Av_ 1 i v_v_ ivjvjvj j. v_ j. uuu«v. j.*»vjvj>- 
^TTTAGGAGTTGACTCAAATGCTCCCTGTCATAATTATGTGAAAAAATAT 
AATTATTAGCTCCATGGTACAAATTAAGGAGAGGTTACATAAATAAAAAG 
-.AATGATACTCAAATTAGTAACCAGAGCCCATGCTCTTAAACACTATGCT 
.„—„-,- - .^-~t> * /-»■» tti r^nTnrzr & _ & & rrr f*a & AfViCTAGATTGAC 




"CTGTC CACTTv_v_Avjv_ UAAvjA 1 Vj.f—AVj i Aunnvin x _ w*\_n. ' — 

CGCATTAAACAACTTAGGAATCAGACAAAATATACAAAGCATTGTTTGTT 

ACACATTGGATAACAGACAGCACTAGATAGTCGTGTCTGAGAAAAGCGGT 

GAAATGAGCTGAGTCTTAGAATTGCCCCAGTTTACTAAGGGGCATAGTAA 

GGGCATAGCTGCAGCACAAAGAAGCAGAACCCAACAGAGACTGGCGTTCA 

CCTGAGTTGAGAAAACCAAGTTGAAAATTTAGGAACACTAACACAGATAT 

GTAGGCAAGAGTATCAGAGAGGAGACAGTTGTAGGGAAAAAGAGAGCTTT 

ACAGAGAGACAGCGAGAGCTCCAGAGACCCGCAGAAGATTGCCCTGACGT 

CACTAGCTGAGTACCGATCAGTGCATACATGTAAGGATATTACTCAATAT 

GTGGAAAAGAACAGAAGGAATGATGTCCAAAGCTCACCCAAAGACAGGAA 

TCATTTATGTTTCCACCAGCCAGAGTGGAACAACCTTGTAACGCATATGG 

, * 7v r^n.t, ATiTTTrrTr &RTAATAAGTTCAAATTAACTGAGACT 




TTAAATTCAAAATGTCTGGCATCCATTAAAAAATTACCAGCCTTGAAAAT 
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tggcgggaaaatatta; -jataatgaa,. .gaaaaagcaatcaagaga/ 

aggcctagaaagtatacatatgataaaattagcagacattaaatggttat 

gattaatttattttatatgttaaagaaggtagagaagagcataagcacat 

taaagagagacaggaaagtcccagtactcacacagggccaggagcagttt 

tcaccagtcaggtgggaaaacttcatatttcatggagcattggtagagta 

cacagtgtcttgccttagtagagggataaatgctgttctgttcccgccta 

acccatcttgaaagaaaatctgaaaggatcaaactgtattcaagtaacct 

aa^cacatcccagcacacagctcgactagttataaaaacacaaaatatta 

atatctagaaacacaaaaataatatctagcacccaacaaggtaaaattca 

caatgtctagcattcaattgaaattttctaggccatcaaagaagcagtaa 

aatatgacctataaggccgggcacattggctcatgcctgtaatcccagca 

ctctgggaggccaaggtgggtggctcacccggaggtcaggagttcaagac 

cagcctggtcaacatggtgagacctcatctctactaaaaatataaaaatt 

agcccagcatggtggtgggcgcctgtaatcccagctactcaggaggttga 

ggcaggagaatcgcttgaacctgggagaaggagaccgcagtgagccaaga 

tggcaccaatgcactgcagcctcattagagaacatcgggaag 

>Contig42 

gaaactaaaggcttatttaaagcgcgagaccgtggcgcctttggactgga 
ccctttctaatgatcatttagtatcaggctatgtgggagttgaccgtttt 
gcatagcctgaaagccaacagtatcactcctcctctaggtgtggcagaga 
tgtgagagaaggagactgacagtctgtgggtgtgtatgcagtgtt.ggggg 
aagcgaggcacaggggacaatactgtggtgtagaaaactagtctaaggta 
gcatcaggaaattcatgaaaccaaaatgaatttcataacagcacaagaca 

TTATTTGTTTTTGCCTCCCTCTCATTTTTTTTTTTTTTTGAAACAGAGTC 

TTGCTCTGTCATCCATGCTCGTGTGCAGTGGTGCAATCTCGGCTCACTGC 

AACCTCCACCTCCAGGGTTCAAGCAATTCTCATGCCTCAGCCTCCTGAGT 

AGCTGATTACAGGTCTGCACCACCCCGCCGGCTAGTTTTTGTATTTTTAG 

TAGAGATGGGGTTTTGTAATGTTGGCCAGGCTGCCCTGTCATTTTTTTTT 

TACTAGTGTCCAGTGGAGTTTTTTAGGGGCTACATAACATGATACTGTCA 

TTAATCTAATGGCTAATGAAAGGGATATGTATATGTTTTTGTGTTTAAAA 

CAAACTTCTTTGGGGTCCTCAATAATTTTTAAGAGTATAAAGGGGTCCTG 

AGATCAAAGAGTTTGAGTTCTGCTGGACTGGGACAGTGGTTGTCAACCCA 

GATTGTACATTAGGGTCATCTGGGAAGCTTTAAAATAGTACTGATGCCCA 

ACCTTACCGCAAACCAATTAAGCCAGAATCTCTGTGGATGAGAAGTCTTC 

ATTGTCATCATCACCATGACCATCATCATTGTCACCGTCACTACACCATT 

ATCATCATCATCATATCATCTTCATTATCATTGTTAGTATCTCCATCACC 

ATCATCAGCATCACCATTATTATCATCATCATCATCCCCACCATCATCCT 

CATCGGAACTTCACCTGCATGGAGGACAATCCACTATGCATTAGGTGCTA 

TGCTATTTGCTATACTCCTTATTCTCACAACTGCCCAGAGAGGCTGATAT 

TATCTCACTTTATAACAGGAGGAATCTGGATCGGAAAAGTTAAGGTAAGC 

TAATTCACAGAGCGAGAAGAGATAGAGCCAGGATTCGAAACCAGTTCTCT 

GCT AC AT CAATGTTC C CAGT C CTTGCACTATTGAGAAC CT CTTTAGTTAT 

G CTTT C AC C C CT C CAACAC CACAGTAAATTTTTT CTTTTTTT AAAAAAAT 

TATACTTTAAGTTATAGGGTATATGTGCATAATGTGCAGGTTTGTTACAT 

ATGTATACATGTGCCATGTTGGTGTGCTGCACTCATTAACTCGTCATTTA 

CATTAGGTATATCTTCTAATGCTATCCCTCGCCGCTCTCCCCACCCCATG 

ACAGGCCCTGGTGTGTGATGTTCCCCACCCTGTGTCCAAGTGTTCTCATT 

GTTCAGTTCCCACCTATGAGTGAGAACATGTGGTGTTTGGTTTTCTGTCC 

TTGTGATAGTTTGCTCAGAATGATGGTTTCCAGCTTCATCCACGTCCCTA 

CAAAGGATATGAACTCATCCTTTTTTATGGCTGCATAGTATTCCATGGTG 

TATGTGTGCCACATTTTCTTAATCCAGTCTATCATTGCTGGACATTTGGG 

TTGGTTCCAAGTCTTTGCTATTGTGAATAGTGCCACAGTGAACATTCATG 

TGCATGTGTCTTTATAGCAGCATGATTTATAATCCTTTGGGTATATACCC 

AGTAATGGGATGGCTGGGTCAAATGGTATTTCTAGTTCTAGATCCTTGAG 

GAATTGCCACACTGTCTACCACAATGGTTGAATTAGTTTATAGCCCCACC 

AACAGTGTAAAAGCATTCCTATTTCTCCACATCCTCTCCAGCACCTGTTG 

TTTCGTGACTTTTTAGTGATTGCCATTCTAACTGGCACCACAGTAAATTT 

TTATAGATTTTATAAGCAAATTGTATTTACTGTGCAAGAATTGGTTTATT 

TTTTAAACCATGTGTTGCAAACATACAATGGTTAATTGTGATATTTGCTC 

AGTACAAGATCATCAGATCACTACACAGACTTGAGGTAATTCCACCTAAA 
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AGCAAAGAGAACTGACCCCACATTAACTGAGAAGTCTTTACTTATTTA'i. i" 
CCCTATAAACGAGCCAATATGAAGAGAAGGCCTTAATGTGGTTAACTATG 
TAATTTTTTTCTGACTTTTTGAAATACTGAGAAGAGCTCATGACTCTCCC 
ATCTC~TAATTCTACCTTGGTGGATTTTAGACTGACCACAACTCATGGGT 
AAATGAGGGAAGACGAATAAGAAACCTTGCTTTTTTTTCCTCCTTGTTTT 
TGGCTGGCTGCAGTGGCTCACACCTGTAATCTCATCACTTTGGGAGGCCA 
AGGTGGGAAGATCACTTGAGCTCAGGATTTCAAAACTGGCCTGGGCAACA 
TAGTGAGACCCCATCTCTAAAAAAAAAAAAAAAAAAAAAAAAGGCGACGG 
GCGGTGCGTGCCTGTAATCCTACCTACTCAAAAAGCCGAGGTGGAAAGAT 
n r«rpr.s rzrz. TfTGG A.GGTCAAAGCTGCAGTGAACCTTGATTGCAC CACTT 



CATTCCAGCCTGGGTGACiUVAGUAGGAUOi- x uv-v_ j. w*ri«*«*«v-™---~---*w 
AAAACCTTAATTTTTTGGCTATTCTTTTCTGGTAAGAATGGTATAGAGAT 
GGGGATGAGGATGGCTATTGTATGAGAGAGCAAACAGGGTCCAAGCAGTG 
CTCTGGGCTGTCTAAGGACCAGTAGTCAGCTTAACTTCTCAAATTTCCAG 
GGAAGGAGTTCGGAGTGGTAGAATATCCTGGGTATGCCCAAAGCATCACC 
TTGCAAATAGCCTGTCATGAATAATTTGTTTCATTTGTTATGACTGGAAA 
CTGGCTTTGTGTATGCCAGAGAATGGGGGCAGGAAAGAGAGATTGGTGTC 
« ^ rn-nrrTfrrwrrrTRRfinr anTGATGCTTTTCCTCTCATGTGGAA 




GGAGAGCATGACTGAAAAGG lliLALAhA J. «-rt.«j<J ~ * 

AACCTTCCAGATACAGAGACACAACCTTCCCCAAGAGGTCCTCATTGCTC 
^GCCT^^TTTCCTTTTTTTTGCTTGTTCTACCATTAATAACAGAAACTGA 
TTATGA.CCTCAAAAGAGAGGAGAAAGCGACTCTCCCCACCCTAGAGCTAG 
TTAACCACCATATCTTCCTAGATCTCAGTTCAAGAGTCACTTCCATCCCC 




TCAGTTCCAGCATCTAGACAGTACCTUAAGLJW^wvAvaov-L.urtvavawwww* 
GCGGATCACGAGGTCAGGAGTTCGAGACCAGCCTGATGAACATGGTGAAA 
CCCCGTCTCTACTAAAAATATAAAAATTAGCCAGGCGTGGTGGCAGGTGC 
CTGTAATTCCAGCTACTCAGGAGTCTGAGGTAGGAGAATCGCTTGAACCC 
GGGAGGTGGAGGTTGCAGTGACCTGAGATCCACTGCACTCCAGCTTGGGT 
GACAGAGCAAGACrTCGTCTCAAAAAAAAAAAAAAAAAAAGAAAGAGAAA 
AGAACATCAAATGAATGAATGAGTGAGATGAATGAGTTAGCAGTGTTGGA 
TTTAAGTGTCAGATTCTTCCCAGCTTGACTTTTTTCTTTGGCTTAGTGAT 



TTTAAGTGTCAGATTCTT5-Ct-ft.GU ^ i. luiutii j.««v- * — 
TTTGAGGTCNCAAGATTTATTTTCCTTTCACAAAGGTGATCACTACCATA 
AGATCTTCAGAAAAAGAATGTGGCAAGCCANGTCTCACTAATGCAAATCT 

CTATAACAACTGTATCAGTACT 
>Contig43 




AATCTAAGATCCCAGGGGGGGGAGAt;UA<-A*-<jv_i ivjvja j..«.wv»*w«.ww>_~ 
AAGTGGACCGTGTGGCCAGAAGAGTCCCGCACTGCACTCTAGTGACAGTG 
CAGAAAGTCACTGTGGGAAATCTAGAAGTTTCTACAGGTTGCTATTTCAT 
CATAGCACTGTGCAGGCCAACCCTTCCTGCTCCACTGGCTGTTGGGAAAA 
GCTTTCTCTTTTCTTCCTAGCCAGGGAGCTCTCAAAGTGTTCCACTCTCT 
„„„,„^„ o 1 -^*/-.r?^rs'rnr'a.rv2'rr:Tf:f3afi€5A.CACTTGCCGGCTGCTTGTC 



TGCTGACTCATCCCTTGGTTTCACTTGGAAAAv_u i^ual^auv. j. -l. 
CTTTCCAAGCATCZAGCCTCCTCATTTTCTTAATCCCTTAGGTGTGATCTC 
ACCTCCACACAGTAGATTGCCTCAAGGCCCAATTCCAATATGAATAAAAA 
TGATTATTTTGTCATCTTCCAATCTTCCTTTTAAAATATTATTTTATAAT 
„~~~„ ^™ ^_/?rt^i rrTi ar.TRi arsaCTATTTTTACCTAAGAAATGT 
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CTGTGGGAGGGGAAAG1 w JTTCAGATCAx CTACAGGACAGCCABrTGGiw^ 

■CTCAAATGGCTTACAGTTCCTTCATCCAGAGGGTCTTCATCTAGTACATA 

CCAGGTGCTAAGCCTGGGTGCTGGAGACATGACGGGGAACCCATTTACCA 

TGGCTTTGTTACTGTGAGATTCACATCTAGGGAAAGCCAGCAAAGGGGAG 

GG ATC G AGGAGAG CTTGTTAGGCAGAGAAAAT AC C CAAGGGCAAGGGAGA 

AGCCAGCCTGTTCTGAGCACACACAGTGGTTCCATCTAACTGGGCCTCAG 

TG£CAGGTTGGACTGGAGATGGGGCTGAGGAGCTGTCACAGAGCATTCTG 

GACACAGATGTCACATAGTCCCTTGAGGTTAGGGTCCTTAGGCATGGCAG 

CATTGCTTTGAGTTTTTCCTTTTGTAATGTTGCCATTCATGACAATGTGG 

AAGATGGGTCCTTGCAGAGAAGGGCAGGGCTGTGAGACCAGTTAGGAGAC 

TAAGATGTGAGCCAAGGAAAATGAGGAACACCTGAACACTGGGGCAGGTG 

CAGGGCCCAGAGAGAAGCAGATGGCTTCCTGAGGTTTTAAGTAGGTAGAA 

TCAAGGCAGCTGGTAAAGATCTTTTATTACATATAAACTGGAATAAGCCA 

TCTGCTCCAAGACAAAAGAGTAGGCGGAAAACAATACAAGACAGAAATGG 

AATTAGAACAAACCTGGGAGGAATGTGGAATTAGAGTAGAGAGTCCAACA 

CTGGCTGCAATCATAAAAATGTAAAACAAACAAAAATTTGCEAGGTGTGC 

TTACTTAGAAATAATTAGCTGTCATATTAAGTTCACTTGTGTTATGGCTT 

AAATGTGTCCCCCAAAATGTGATGTGTTGGAAACTTGATCCCCAATGCAA 

CAGAGTTGAGAGATGGGACCTTTAAAAGGTGATTAGGTCATAAGGGTTCT 

GCCCTCATAAATGAATTAATACTGTTATCATGAGAGTAGATTCCTGATAA 

AAGGATGATCTCTGCCTCCTCCCCACAGCCCTCTTGTGCATGCTTTCCTG 

CCTTTCCACCTTCTGCTATGGGATGACACAGCAAGAAGGCCCTCACCAGA 

TGCAGCTCCTTGATCTTGGACTTTCCAGCCTCCAGAACTGTAAGCCAAAC 

AAATTTCTGTTTATTATAAATTACCCAGTCTCAGGTATTCTGTTCTAGAA 

GCACAAAATGGACTAAGATCATTAGATTATCATTTTTTATCAGACTGTTG 

AAGTGAAAAATAAAAATCAAATAAAGAAATTAAGAGAGCTGCATGCAGCA 

GCTCATGCCTATAATCCCAGCACTTTGGGAGGCCAAGGCAGGTGGATTGC 

CTGAGCTCAGGAGTTTCAGACCAGCCTGGGCAACACGGTGAAACCCTGTT 

TCTACTAAAATACAAAAAACTAGGCCGGGCGCGGTGGCTCACGTCTGTAA 

TCCCAGCACTTTGGGAGGCCGAGGCGGGTGGATCATGAGGTCAGGAGATC 

GAGACCATCCTGGCTAACAAGGTGAAACCCCGTCTCTACTAAAAATACAA 

AAAAAATTAGCCGGGCGCGGTGGCGGGCGCCTGTAGTCCCAGCTACTCGG 

GAGGCTGAGGCAGGAGAATGGCGTGAAACCCGGGAAGCGGAGCTTGCAGT 

GAGCCGAGATTGCGCCACTGCAGTCCGCAGTCCCGCCTGGGCGACAGAGC 

GAGACTCCGTCTCAAAAAAAAAAAAAAAAACTAGCCAGGCATGGTGGTGT 

GTGCCTATAGTCCCAGCTACTTGGGAGGCTGAGGCAGGAGAATTGCTTGA 

ACCCAGGAGGTGGAGGTTGCAGTGAGCTGAGATCATACCACTGCACTCCA 

ATCCAGCCTGGGTGACAAAGCAAGACTACATTTCAAAAAAAAAAAGAAAG 

AAAAAGAAAAAAAGAAAAGAAAAAGAAATTAAGAGAAGGGCAGGTATTAA 

CCCCAAATATCCCACCATAGGGACACATTAAAGTTTGCTTGGCCACTCCC 

CTAGCATAATATATGGAATGTCTTCAAGGACCCTCTGTTGTAAATACAAG 

GCCCTGCTGGACTTAATACAACCTGCAGGCTTTGAGATCCCTACTCTGTT 

GCCATCTCTCATAGGATTTGCAGACCAAATCCAAATACTTAAAATAGCAA 

CACTCACAAACATGCAAATCAGAGCAGAAAAGAAACTTCTAAAAGGCCCT 

GAAACTACACTTTATGAGAGAAGACAATAGGGAC CTGAGGGTGGTAGAAT 

TTTCTCTCTATGCATCTATGTTTCCAGGGCTCACTTTCTCAATAAACTCT 

TAAATTGCTTTTAAAGTAAGGGAACAAGCAAACATTACATTTAAGAGAAA 

TCAATTTCATAAAGAAGGGGGGATGTCCAGGGTACTTTGCTTCCATGTTT 

TGCTTCCATGAATTTGTGTTTAACAGAAGATGCAGAAAAACACACAATTA 

TTGCAAAATCAAGGAAATCCACTCTAAACATCCCTTGGTTTCCCAGGCCA 

GTGTCACAACTGAAAACACATATTGTGGCTAATTATGTGTCACAAATTAG 

AATGACAAGGCAAGAAAAAAAAAACTCTCTGATTAACTAATAGCAGCCAA 

CACAGACAGCCTGTGTAGCTCGACTCTGCTGGTTTATAAAAGGCAGAAGA 

AGCAAACGGCTTCTGTGACCGCAACAGGAAGGGCCTCTGCTCTTAATAAA 

TAAATAACATTTAAATTATTCTCCCCCZATTTGCAAAGCATTTTCCAACTC 

ATTATCTCATCTGACCAGGTATTATTGTATCTGACCAAGAACTTGTATAC 

NAAAT AAAG AAT AAAAAAT AAAT ATGGG C CANG CACAGT GG CT C ATG CTT 

GTAATCCCANCACTTTGGGAGGCCCAGGCGGGTGGATCACTTGAGGTCAG 

TAGTTTGAGACCAGTCTGGCCGACATGGCGAAACCCCGTCTCTACTAAAA 

ATACAAAAATTAGCCCGGCATGGTGGCACATGCCTGTAATCCCAACTACT 
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TGGGAGGCTGAGGCACGaGAATTGCTTGAACTCGAGAGGCG^GGTTGCA. 

GTGAGCCGAGACTGCGGCCATTGCCCTCCAGCCTGGGCGATGAGAGCGAA 

ACTTCATCGAAAAAACAAAAACAAAACAAAACAAAAAACACCTTAGAAGA 

AGCGTTCCTCCTCTTGCTTTCTGAAGACACTCTACGCTGAAACAGTAACT 

TTCAATAAACCATCTCTTCTCACCGCACTCTGCGACTTGCCTTGAATTCC 

TTTGTGTGCAAGATCCAATAAGCCTCTCTTGCGGTCTGGATGAGAACCCT 

TT-TTTTGGAATACTCTGACACAACAAATTGCAGAAAGAAAGTCTCACATG 

TATAAAATAAGCAAAAAGATTCTCTGGCATCTGAAGAAACAATTTCCTTG 

TCAATATTAGTATCACTATAAGTGTAGAACAACCTGTTGTATGATGCTAC 

ATAAAGTATATGAATCTGAATACTGTTGGATACAAAGGGAGACTATNNAA 

TGTAATACGTCGCCCGAAATGACTACACTGTTGGTGATCTTTCTTTCAAG 

AAGCANAATATTGCCTCNAACATCCTGTACATGGTATAAAATTTTA 

>Contig44 

CCCAGCAAGAACACCAATACAACGGGGGGGGCGTTCTTTGTGAGGGGTGG 

GGAGGTCAATTTTTTGGAACCTGCAGCAGGTAACACACAAAACTTCCACA 

GCTGCTACCAGCTTTCCAGGAGAGCCTGTGTACCTGGAGAGGAGAAGGCA 

AGTGCTTCCGAACTTGACTTGATGTCTTAGATTCTGCAATGCGTAGTCTG 

TAGGGACAGGCTGTAGCTTATCCTATAGGCTTGGGCTGGAGTCAGCAAGC 

ATCTGGGCTGGCAGAAGATAAAAGATGCAAAGGTGGAGGAAAGCATACGT 

GGTCTGGAAGACAGACTTGGTGGGTGGGTGGCTGCTACAACACCCTAGTT 

AGAGGTAGAGGGGTAAGTCAGTGTGTCTTCTGCACAGGCCTCTTCCCCAC 

CTCATTCTTCATTTCCCATACAGCCTTGCTGAGTTATTCACAAACATCTG 

ATTCAACTGGAAGCTGGGTTGAGGATGACCTAAAGGACTAGTGTGATGCC 

TGCCCAGGGGTGTGGGCCCATAGTCAGAGTCCAGAGCCTCCTCTCAGCTT 

TTAGCACATCTCACCCACATCCTGGGTCCTTAATTAGCAATATGAAAGCA 

AGCCAAGTGACAAGATTTTGTCCCTGGGAAGTCCAGAAGCACTCCTTTTC 

TCATTTGTATAAGCATAATGATTTGCTTACATAAATAATCATGAAAATTC 

AAATCCCTCTCAGAAATCAGGTCATAAAACCATGAAATGCAGCATGTGGG 

CAAGAATCACAGGGAAAGGTAGGTCTTGGAAAAGAAAGGATGGCAGGGAG 

GAAGAAAGCAGGGTGCCAGGGGC CCTGGGCTGCTGT C GAAGT CAGGTGGC 

TCACCGTCTCTGAGAACATTTCACTTTCTGGTAAATGGGGCAGTTGGAGA 

TAGAAGGGTTGGGTGAATGCCAAGAGTGAGCACAGCTGAGGTCAGTGCTG 

TGCCTGCAGTCCAGGCGGGAGTAGAAATCCTGGGCCCATCTTACCTCCGA 

CCTCATTTCCTCCTCTGTAATAATGTGGGGGTGGGGGAAAGTTCTGGTCA 

TCAGCCCTAGCATTCCATGGTTCATTTCCTCATCAGTGATGGAAAATCAC 

CAAGCAAGAGAACAGGATGGAGAATAACCGGATGGGTGCAATCGGAGGTG 

CTATTTCAGGTGAGGTGGCCAGGGAAGGCCCTCTGAAAGGGTGGCTTGAG 

CAGGTGGCTGAATGTACAGAAGCTGCCAATCATGAAAGATCTGGGGTACA 

GCATGCCAAGCAGAGGAAATGCGAGTGCAAAGGCCCCGAGATTGGATGTG 

GGCTTAGCACAAATGTGGCATGGCAAGAAGGCCAGTGTGGCTGAAGCAGC 

ATGAACAATGGGTGGAGGGGCTGAGAGGACAGAGGAGCAGGAAAGAGCCA 

GGCTTGGGTAGGAGAGGTGTCAACTTGATATATGATGCAAAGCCCTTGGA 

GGTTCCGAAACACAAAAGCAATGATCTAATATATGGTTTTAAAAATGCCA 

CTCTTGGCCGGGCGCGGTGGCTCACGCCTGTAATCCCAGCACTTTGGGAG 

GCCGAGGCGGGTGGATCATGAGGTCAGGAGATCGAGACCATCCTGGCTAA 

CAAGGTGAAACCCCGTCTCTACTAAAAATACAAAAAATTAGCCGGGCGCG 

GTGGCGGGCGCCTGTAGTCCCAGCTACTCGGGAGGCTGAGGCAGGAGAAT 

GGCGTGAACCCGGGAGGCGGAGCTTGCAGTGAGCCGAGATTGCGCCACTG 

CAGTCCGCAGTCCGGCCTGGGCGACAGAGCGAGACTCCGTCTCAAAAAAA 

AAAAAAAAAAAAAAATGCCACTCTTGCTGTGAAAAATTGACCCTGGGGGA 

AGGAGGAGTAGAAATGTCAAAAGTGGAAGCAGACCACTCAGGAGGTCAGG 

GCAATGGACTGTGCAGGAGAGACTGACATCTTAGACTCGGGCAATAGGAG 

AGAAGGTGGTGAGGATTATATTCTGGGCATAAAGGCAACAGAACTAGCTG 

ATGGCGTCAACGTAGGAGATGAGGGAAAGAAAGAAATCAAAGGGCATTCA 

TAGGTTTGAGGGTTGAGTAACTGGGGATATTTAACAGAAATGGAGAAGTC 

TGGGGAAGGGGCAAGTATTGTGGGGGCAGGGGTCAAAAGTTCTGTATTTT 

GGCCAAGTTAATTAATATTTGAGATACCTCTTAGGTGTCCAAGTGAAGAT 

GTCAAACAGTCAATTGAATACAAAATCTGAATCTTAGCCCAGGATGGTCT 

CACACCTGTAATCCCAGCACTTTGGGAGGCTGAGGTGAGAGGATCACTTG 

AGGCCAGGAGTTTGTGATCAGCCTGGGCAATAGAGCAAGACCCTGTCTCC 
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ACACACACACACACACA . AAAAAGT CA'x wCAGGCATGGTGGCAGIATGCw + . i 
GTAGTCCCAGCTACTCAGGAAGCTGAGGCAGGAGGATCACTTGAGCCCAT 
GGTTCAAGGCTGCAGTGAGCTATAATCACATCACTCAATACTACACTCCA 
GCCTGGATGACAGAGAGAGACCTCATTTATTAAAATAAAATTTAAAAAAA 
TTAATTAAAAATAAATCCAAATCTTTCCTGAGATTCATATTCAGGAGTAA 
CTGTCATGTAGAAGGCATATAATGCCATGGGTCACATGATACCATCTAAT 
' GAATGCCACTGGAAAAGAGAGAATAGCTAAAAACTGAGCACTGGGCACAC 
CAbCACAGTGAGGTTGGAAGGAAGAAATGGAGCTAACAAAGGAGACAAAA 
GAGGAGTAGCCAGTGAGAAGAGAGAAACATCTGGAGAGAAGAGAGAGCAG 
CAAAAGGTGGGTGAAGGAGAATGTGGTCCACCAGGCCCAACAATGCTGAG 
CAGTTGAGTAAGTGAGGACCTGGCCACTGAATTTGGCAAGAAAGAGGATG 
TCAGCGGCCCTAGAACAAAAGTGAAGAAGAGCTTGAGGACGGAAGCCTGA 
CAGGAGTGAACTGAGGAGAGAATGAAAGGTGGAGACATGGAGCCAAGGAG 
CACTGAGACTCCCTTGAGTAGTTTTGCTGTAAAATAAAAGTGAGTGCAGA 
GACGGGGCAGGGGGACAGAGAAATGCAGGGGTAGCTGGAGGGAGCCACAG 
AATCAAAAGAGGGTTTTTGTGTTTAAGATGGTAGTTGTCACATAGCACAT 
TAGTAAGTTCATGTGAATCACAACGTAGGTGAGACAGATCACTAATGCAG 
GAGTCAAATCCTTGCAGAGCCCCCAGAGGAGGTGATGAAGGGAAGTGATG 
GACATCATTCAGATGCAAGTAGGTTAGCAATTCCTGGGGTACAAATAGGA 
GGTGACTCCTTTCTGATTGCTCCTGTTTTCTGAATGAGATAGCACATAAA 
GTCCACTCAGCCATGTTAGCTGTTGAAGTCCTTGTGGCTGTCATGCCTGT 
ACAGACTGGGCTCTCCTCTCCAGCATTTCCTCTCAGACTAAGCTGAGCTG 
CACTAGCCGCTGCCACATCCTCTTGGGGCCATCCTCTGCCACACTCCACA 
TATTGCTGTGGTTTGCTTGCAACCCCTGGAAGGTCCTACTGGCTGCTCCT 
AGAAGAGTCTGGGCGGCATCTCTCCCTTACTCGTTATCACATGGTGCTGT 
AAGCAGTGGCCACACACTTTAGCTGGTGGGATGGGCCATCACAGGCAGTA 
AATGCGAAAGACTGCTCAGATTTTAAAGCACCCATGAATCAGTAGAATGA 
GTTTAGAATTGTAGTCATCAACACACATTAAAAAAAAAAAAAACAGGCAC 
TAAAAAAATTAGTTGAGTAGGATAAAGCCATAAAAGATATTAACTACAAC 
CCAGATAGGAGGTGCAAAATTGTCCTTACATAAATCAGATGGAAAAAGTT 
GAAAGCAGATAAGATAAAATAGGTAAGCATGACATTTAAAAGGTATTCAT 
GGGACGTGGTTACAAAACCAACTCACAACTAAAAAGTCTTAGGACCTCTC 
GCTGACTTAGGAGCCTGATCCCAACTTTGAGAATGACTCAGTGTGTTACC 
CTGTGGCTAGTGTAGACCAATGATCCTGTCTCAGAGTCACTAGCCAACAG 
CCCATATCAAGTAATTGAAACTTTGACTCAGAAACCTCAGTGTCAGAACC 
TTTGACTTAGGAACCACCTGTAGTGGTTAACTGCAATTTGCACCCCTTAG 
TTCAGGGCTTTACAACACCGGGGGCGGGGAGGGGGAAGGCATAGAGCTGA 
TGACCTAAAGGAAACCCATTGCAGCAACGCTTTTGTGTTAAGTTTACAAA 
TAAGTGTTGTTTTAGAATCCTCCAGGTAATGCCTTTGTTATTTAATGTGT 
CTGAGACAATTCTGCACATTAAAGAATATAAAATATTAC CTTGTAATTCC 
AATTTGAAATGTGTAATTGACATTAGACTTCTATTTTAATTTGAAATGTC 
TAAAACAATGTGGTTAAGTTTGTAAAAGGTGTGTGAATTTTGAGTCTGAT 
TTACTACATTTTTTTTTAATTTTCTTTTTTTTTGGAGTTTTAGGGATTGC 
TTAGATGGCTAGAAAGATCGCTAGGCACATGTCC 
>Concig45 

GATGTGTGTACGTGTGTGCAAATACCGTGCCTTTTTTGTTTTCTTTTGTT 
GAAACAGAGTCTCACTCTGTCGCCCAGGCTAGAATGTAGTGGCGTGATGT 
CAGCTCACTGCAACCTCCGCCTCCCAGGTTCCAGTGATTCTCCCGCCTCA 
GCCTCCCAAGTAACTGGGATTACAGGCGCCCACCAACACGCCCAGCTAAT 
TTTTGTATTTTTAGTAGAGACGGGGTTTCACCATGTTGGCCAGGCTGGTC 
TCTAACTCCTGACCTCGAGATCCACCCACCTCGACCTCCCAAAGTGCTGG 
GATTACAGGCATGAGCCACCATGCCTGGCCAATACTGTGCCATTTTATTA 
TCAGGGACTTGAGCATCCATGGATTTTGGCATCCATAGGGGTCCTGTAAC 
CAATACTGCACAAATACCAAGGGACAACTGTATTCTAAAAAGACCAAAAA 
TTAATAAGCAGGACGCTGAAGGTAATTGCCCCAATAAAGTCATGATCCCT 
TGCCCAGTGTCTGAACCTCAGCCAGTTTTCATACTCAGGACCTATTGGCT 
GCAGAGGTGGTAGGAACCATATGAGAATCCTGCAATATCATGGCAAGTAT 
GCACTTTAATGATATCTGCAGTCCTTCCCCAAAAGGACCTTACATTTACC 
ATACTGCTATGTCCTGCGTGAGAGGGTAATACTCAGATTTTTTTTTTTTT 
TTTTTTTACACAACGTCTTACTGTGTTGCCCACACTGGAGTGCACTGGCT 
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CGATCTTAGCTCACTGC XTTCTGTT1 - JTGGGCTCAAGTGATTGTC' . 

GCCTCAGTTTCCTGAGTAGCTGGGATTACAGGCGCCCGCCACCATGCCTG 

GCTAATTTTTGTATTTTTAGTAGAGACGGAGTTTTGCCATGTTGGCCAGG 

CTGGTCTTGAACTCCTGACCTCATGTGATCCGCTGGCCTCCCAAAGTGCT 

GAGATTCCAGCGTGCGCGGCCATACCCGGCCGGGAATTCTTTATATATTC 

TGAAAACTAATCCTTTGTGAGACATAAGTGTTGTAAATATTGTATCCCAG 

TTTGTGGCATGTATTTTTAATTTTTAATGGTGTCTCTCAATGAAAAAAGC 

TTAACACTTAAATGAGGTCAAATTGATCACCTTTTTATTTATGGTTGATT 

CCTTTGGTGTCATGTGTAAGGAATGTTGTTCCTTCCTGTCCCAAAGTTGC 

AAAGATTTCTTGTGTATTTTGTCCTAAAAGTTTTAAAGTTTTGCTTTTCC 

CATCTGTGCACATTTCACATTTGCTACATCTCACTGACTGCTTCCTCTGC 

TGCAGAGCAAGCTCCATGAGAGCAGGAGGCATGGGTCCTGCTTCTTGTTG 

GTCCCCAGAGCCCTATGTCATGACTAGGACCTGGCAGGGGACTAGTGAGT 

AGCTCCTGACTAACTGACTCAATGAATGAATGATTGGATGATTGAACAAA 

GTGGTATGGGAGTTCACAGCGAGTAAGAGATGCCTTAGAAGAGATGAAGA 

AGGAGATGGTATAGGGTAGTGGTTCTCAATTCTGGGTCCATGGTGGACTC 

ACCTGGGGACCCTTAAAATGTACCGTGGAGGATCCCAGCCCAAGAGATTC 

TGTATGACTGGTCTAAGATGTGGTCTGGGCACCAGGTGATCCCAGTGTGC 

AGCCAGGCCTGAGGCCACTGGATTTGGTGGTAAATGAGGTAACTATCAAG 

GGTACAGACGTTGGTTGCCAACAGGCTTGGGCTTGAATTTAAGCTTTGTC 

ACTGACTTGCTGTGTCCTCCTGCACTCGTTGAGCCTGTTTTCTCAGCTGA 

GAGATGGGTGTGATAACACCTACCTGCTGTAGTTGTTGTGAGAGTTAGAG 

GAGATAAGCATGTTCCTGGAATGAAGTGTGTTCTTAATCCATCATAGGTT 

TTTTGCTTGTTTGTTTGTTTGTTTGTTTGTTTTTTCCTTTTCAAGAATGA 

GGTTGAGCCAGACTTTGACAGCTGGGTGGGAAGTGAACATGTGGTGATTG 

GGAGAGAAGGGCAGTTTATGTGAAGGGAATGTAATAATTAGAGAGTGGGC 

GTGGGAAGACATGCTGGGGAGAGTGAGCAGGCCGG7TAGCCCTGGTAGAG 

GGTGCAAGAGAGCAGTGCGGAATCTGCCAGGGAGACAGGTGGGTGACCAG 

GGTGCCAAGGGTGTGGCTTTTCCCAGGTTCCCATGGACACAGCCATCCTC 

CCAGATGCCCAGCCTAGCTGTGAGTGAGCAAGAGTTCTGGATTGTCTCTC 

TCACTCTGTCTTTTTCTCTCATTCCAGAAACAAAGCAGTGACTGGTACTT 

AGGAGGAGAATCAGGTCAAGTTGGGAGAAACTTGCTTCTGCTCAGGGGAG 

CAGAAGCAAGAATGGAGGCCCCACCCATGCTGGAAGATGATGAGGGTTTT 

GGTTCAGGGAGGAGGAATATTGGGGATCTAAAGGGGCCTGGGAGTGGGGC 

AGGAC CCTGC CTTAGGACAGGTAGAAACATTTTCTATAAAAAATGGGGTG 

GAGGTTGATGGTAGGACCAGGCATCTTTAGTTGGCTCCCTGGAGTGTCAA 

GCCCTTGAGATGGTCTTTAAAAGCCATGCAGTGGGGTTTGAATCTGGTGT 

TCAAGCTCATAGGTTATTAACATAATGACACTTGGAAACTATTTGGGAGA 

3CTCAAGTGAGTGGCCTGGAAGTTCTGTGTTGGTGCAGGAGGTGACTTAG 

GATGTGCTGCTCCAGACTCATATCTTTGACTGCACACCTGATGCTTCATC 

TGGCTATCCTGTAAGCACCTTCAACTTAACATGTCCTACACAGAACTCTT 

GATATTCCTGTTCCTCCCCCAGTTCCTCAGTTCTTACCAAATGTTCTTCC 

AGTTACCCAATTGCTCAAGTAAAAAATCTAAGTCCTTCTCTTGGATTTCT 

GCCTGTTCCCTCAACATCCCACCTATCCATGAGTGTTCTGTGGGCCCTGC 

CTCTGAAATAAATCCTGCCTTTGTCTCCCAGTTCACTCCAGCCACCCATC 

CTGGGGCTGCACCCTCCTCCTTCCAAGCCCTCTCCCTTTCCTTCCTGGTG 

CTGCCTGTCATGTCAAGCATATGCATCAGTGCGACCAGGACATTTGAAAT 

GCAACCAGTACAATTGGGCGCGGTTATGCCTACCAGTTTTTCTTCCTTAA 

ACATTTTATATTTATGTTTGAAAGCATGCCACCTTTCTTCACTTGCCAAC 

TTGACAGATTTATTAGTTGACAACATCCGCTGATAGCATCAGTAATAAGT 

TAATTGTTTTTGCACATGTAGCTTTAATTATTCTCATTATCATTTATAGG 

AGTTATTCTTTGTAAAGGGTAACTGAGTTTTCCAAAACAAACAGAAATTT 

GGGGTGGGCCCATGGAGCGTGACTCATGAAATCAGATTCTTAGAAGGACC 

TCGGCAAGTCTCTGGGTTGCTGTTAATGAGCCTGGCTGGCTGCCAGGGGT 

GTGTCTGCCCTTTATGAGGCCACCACTGTTCAAATGCTTGCCTGCAGCAT 

TACTTGCCTAGGTAGTGCTTGTTTCTACTGAACTGTCAGGGATCCAATTC 

TTTGTGGTCTAAGTAACAATACTCAGATTCACAAGGAATTGATTAATAAG 

CCAGAATGCCAATGTATTACATTTTTGATGAAGACCATATTTACAGTGAT 

TGTATCTGCTCAAGCTCAAATTAGGATTAGAGTTCTGACAAATACATATG 

TGAGAAGTATGAGGTTAAATACTTGAAATTTGGACTTTTCTAGAAAATCT 
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GAATGTGATTGCCATTCACATACCTTTCTGGGGATGATGATTCTTGTACT 
TTTATTTTAAAAGACATAGAAAACTAACTTAAGAATCAGATTGCTTGGCT 
GGGCACAGTGGCTCATGCCTGTAATGCCAGCACTTTGGGAGGCCAAGGTG 
AGTGGATTGCTTGAGCTCAGGAGTTTGAGATCAGCCTGGGCAACATGGTG 
AAAT C C CAT CT CT AC CAAAAAT ACAAAAAAAAAAAAAAAAACAAC CAAAA 
AGAATAAATTAGCTAGGTGTGATGGTGCGTGCTTGTAGTTCCAGCTACTT 
GGGAGGATGAGGTGGAAGAATTGCTTGAGCCCAGGAGGTGGAGGTTTCAG 
TGAGCTGGGGTTGCAACAGTGTACTCCAGCCTGGGCGATAGAGTGAGACT 
CCGTCTCAAAAAAAAAAAAATCAGATTGCTTTATTGCTGGTTTTCTTTCT 
AAAACTGAGATTGGGTCCCATCATCCCCTGGCCCCCATTGGTTAATGGTT 
CCTCCTTTGTCTATTGAATAAAATACAGATGTCTGCTTTTGGCAACATGG 
TTGAATGTAGACACTGCAGGGTCTTCCTGACTCAAAATGATTTAGGCTTA 
GATAAAACACATTTGGAAATGCATTTCTGGATTAACACCAAGGAAAGGAG 
ATCTCTTTAAATCCCCTTTCTGTTCCCCCCTCCCTACCCCCTCCAATTGG 
GCTTAAGTAAGAAGGGTGGTTACCCGCTAGTAAACCCCCTTCGAAGGGGG 
TCTTCTCCTCTAAGGGAAAACCCTTGTTTTGACATTTGCTTCAATGGGCC 
CTTGTATTTTGTTCCTTGCTAAACGGGTGCTAAACCAGGGGCCTCCTCTT 

>Concig46 

AAGGCTTTTAGAATATTTGCACACTTTAGAAATGGAAATGTTTTTGGGGG 

GCGAGTTGTCTTAATATTTCATTTTTCTAGCTTGTGTGACATCCTTTTGA 

AAGCAGCAATTCTGGCCTTTGTGAGAGATGGTGAATGCCTGCAGGTGTGT 

GGACCAGTGCGTCCCTTCCTTCCTACATGCACGGCCCCCAGCTGGGCCCA 

GCAGAGTGCTGTTACAGAATAATTTCCAAGGGCTGTGTCTCTAACCTTTG 

GTCTTGTCCCCCATTGCTGTAGATTTGGCCAATTGACTTCATAAGTGCCT 

CTTATGAACATAGATGTTGGCAATGGAAGTTGAGGACCAGTCAGTGGTTG 

TTTTATTGAACACACAGCGTAAATCCCAACACAATGCTGACCTAAGAGAA 

TTCCAGCCACTCTGATTCTCAGTCTCTTTATATCTGAAAGGGTTCTGTTC 

CACTTTTTCCCAGATCAAAATGTCCCTGCAGCTACTCAGCAGAGCTGTCG 

CAACTTATACGTAGAAGAGGTAACAGTCCACAAACAGAAAGGCACAGGAC 

GAGAGTGGTCTGGGTGATGCTTCCTGTGGGGGAAAAGGTGATGAGGGTGC 

ATCTGCACACCTATGTTCATAGGTAAGTCTGGGAGGAGGTGACCTCCCCT 

TTGGTTGAGGTGCTGAGGCGTCTTGTTAGAATGGCACTATTCCATTTATC 

TGATGCAGTCTGTGGGAATTTTGTGGTATGGCCACCACAGGTACCATGCT 

GGGAACAATGCCAGATACTGCCTGCTAAGCCACAGCATGAGTCACATGAG 

CATTTGTGGGCTTTGGGAACTAAAGTTATTGAACGATAGTTATCTGAAAA 

GGAATTTAGGGAAAGGGGACTTTAGTCCAGCGAACAGTTTGCAAACCAGG 

GGGAAGGCAGCCTTCAGCGTAAAATGAAGACGTGTGTGCCCCAAATAACA 

AAGGGAGAGTTTGTCTTTTAGAGAGTAAATGTCCACGCAAGGTTCCACTT 

AGGCAAATGAAAGATGGAAACTTGCTTAGTTCTGATTTGTTTACATTTGC 

TGAATTCGGATTGGTCCGTGCAGGCTTTTCTGGGAACTCCAAATACATGT 

ATGACCTCTAGTCATACATGGCAAATGGCCGCTTGGCTCTAATTTGAATT 

TAGGCCCAGTTAGTCACTCAGGATTAACCTTTTTCAGGGTTCACAGCTCT 

GAACAATGGACTTAGACCTGCAGGACATAATCTGTTCCTAACTCTGGGAC 

TACCTGTGCCTTTTGACTGTGCCCAGTGAGGAGCTGTGGCTCTGGGCCCA 

GACCCACAGGGCGATAAGGCACAGAGGTACGCATGGAGCAGGCTGTCCTT 

GCTGAGTGATCATGAAGATACACTTACATAGAGCAGCACTTTTCCTTCCA 

GTCTTTGTGATTTAACTCATTAGATCCTTATAACAAGAGTCAGTCCTCTA 

TTTAACCCATGAAGCACAGGTGGAGTCCAAGCTTAGTTTGTGAAGGATGA 

GCCAAAAGGATTCTTCTCTTGTAGACCTCAAGCTCAGCTCTCTCCATGGG 

CCCTGGAGTAGGTGAGAAGGCCTCTGTCTTCCAGAGCCCACTGCCAATCA 

TCTACATTTTCTGTTAGCCCAATTCTAGGACATTGCTTTACCAACTGAAG 

GGTGAGAACTATCATAAGTTATAAAAATCAATTGAAAAACAAAAAGGTAC 

AGAACAGAAAATAAAAGATGAGAATCTATTAAACATAGTGATGTTACTGG 

AAAAGGGGGTCTCAAACCAGACCCCAAGAGAGAGTCCTTGGATTTCACAC 

AGGAAAGAACTCAAGGTGAGTTGCAGGGTGCGGTGAATTGAGAGAGTTTA 

TTGAAAGCTATTCCATTACAAAGTAGAGCATCCTCAGACAGCAAGTGGAG 

GAACATGCCATCATTAAATTTTTCTTATATAGGAATCTTGTCTATATAAA 

GACTAAACTAAGCTGTGGCTATGTGTGGGTGGGCCGACAGCATGAAAACA 

TTTATTCTCCTATTGATTTAAAGAGAACTATCCTTGACATTTTAGTGTGT 
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TTAAGTACATCAAAGCAl^CTATAATTATCTTGAAAGCATATATTTTT^ 

TAGGGATTGGGACATCTGGGCTTTCTGTTGTTGTAGAAGTTTGTCCTTGC 

AGGGATTACCAAGCCACTTCCTTAGCTGTAAACATCTTAGGGCCATGGGT 

CCTGACTGGCAAGGAATGTGTCTTGCTAGTTTTAAGATGGGCTTGATTTG 

AAAATGGTGTCCATCTGGCTCTCCTAGGCTCCTGCTTTCCTAACAGTAAG 

GGTAAATGCTATGTTATGAAATGTCATTTCTGCCTTTAGCTTGCAAACTC 

TTGATGGTGAAATTCTCCTGTCCGTTTTCAGTGGGGTATTTATTCTGCAT 

CCACGTCTTCACAAGGAGCTGAAAACAAATTGGATGGAAGCAACTGGGTT 

TTATGGGACACGTTAATGTTTTAATGTCATTTGGTGTGGAATTCAGATGT 

CCAAGCAACATTTTACACTACAAATCTGCAACTTTAATAATCACTCAAAG 

TACCTGAACCTCAATGCTTTCAGACAGACTTGGTATAAAGCCACCACCTC 

TTTCTATTATGGCAGCCCTATCCTGAGGACACAAATTTCTGCAGGGCTTC 

TGGCATATCTCTGATTAAACAAATGTCAACAAGGTTAAAACAAATGTCAT 

CTCTGATTTGTTTGTTTTAAAGCCTGGATTTACTCATTGAATATTTCACT 

CCTACTAGCATGTCTTGTAGTAGTTTTCTTCAGGGACCCTAATTATTGCT 

ATTAAAAATATGTGTGCAGCTACATGTTTTTTTTTTATCAATTTGCAATG 

AAAACTTTAATTGAATAATCTATTAGTGTTATTATTTGAAAGTGAAATCT 

TTTCCTTTTGCTTTCTTGTTCTCACACATAGTGCAGACAGTTTCCACACG 

GGCTCATAAAAGGAATGATTCTGCCTTGTGTGAACTTTTTGCCTTTATTG 

TTAATTGCACCATTTTGTGACTGGCTTCTTGACCCTGTTGTAACCAAGCT 

CATAATGTACATTATTTCTTATTTTGCAGTTGTAGACACTTGAGGXAGTT 

CCCATTCTTTGTTTCTTCTTGCTTTTGTTCCCTGTGATAACTTTTTCATG 

CAGACATTTTTTTTTTTTTTTTTTTTGAGACCGAGTCTTGCTCTGTCATC 

CAGGCTGGAGTGCAGTGGCATGATCTTGGCTCACTGCAACCTCTGCCTCC 

CAGGTTCAAGAGATTCTCCTGCTTCAGCCTTTCTAGTAGCTAGGATTGCA 

GGCGTGCACTACCACACCCAGCTAAATTTTTCAAATTAGCCACCCCACCT 

GGCTAATTTTTGTATTTTTAGTAGAGACAGGGTTTCAACCATGTTGGCCA 

GGCTGGTCTCGACCAGGTGATCCACCCGCCTTAGCCTCGCATAGTTGCAG 

GTGCTATTCTGAGCTCAGGGCTCTGGCAGCTACAAGCCCAAGATGCGGTC 

TCCAACATGTGGCCATTCAATGTCATGGCGCCCTCTACTGGTCCTGGGAA 

GCGCAGCTCTGCCAGTAGCTCCAGCAGGGCACAGCTGTTAAGTCGTGATG 

TTCTACAGGTGACCAAAGGGCAATCTCTGGACTCCTTAGCCGCTAGGTCC 

TCTCTGTAGCAGGACCCAGGAGAAGGCAGGGGCTGAGGATGGCTCTCTTA 

GACATTTGTGATGAACCAAACGTGTGCATTCATGAAACTTCTGTGAGCAA 

GCAGGTGAGTAGAGTTGGGTTATAAAAAGTCTTAGGGTCTCACTACAGAG 

ATGGACTTGCTGTGTAGATGGTGCAGAGCCGCTGAAGAGTTCTACTTGGG 

GTAATGGTGTGATTGGGTTTGCGTTTTAGGAAGATTTCTTGGCCAGAATG 

AGGCGGGCAACCCAGAGCAGGGAGTGGCCACATGTGGGTGTGCAGTTATG 

GGCCACTAATCCAGGTGATAAATGGTGTCTCTGAACTTCAGGTGGGGGTG 

CCACATGTCTCCATCTGCTCTGTACCCTTGAGACTGGCCTTATGGGCTGC 

CTTAGTGGTCTGTTGTCCTCTATCTCCTGGTTGGGCTCAGGCAATGGGAG 

ATCAGAGGGAGGAAAGAGAGCTTGGTTAGAGTGCACCCGCGCCCCTTCAG 

GTTGGCAGTGGCCACATTCCCCTATACAGAAGGCCACAGTTTCTGTCAGT 

GGCCCTCCCACAGCCCCAGCTTTCTCAGTGGGCCAGCCACCTCCCCATCC 

CTTGCTCCTCCTCCTCCAGAGAGGGTTGTGGATTTCCACTGTCAGCAGTG 

CCTGGAGCTCCACCATCTCCTGCTGCTTCCTCTGGACCTGCCTGCAGTTT 

TATAAATAACCTTTCCTTACATTACCTCTAGCATGCACCTTTTGTGTGTA 

TACTCTGCCCCCTGTCAGCACATGACTCATGCCAAAGAGTTTGAATTTTT 

TTCTCCAGGCAACGGGAGGTCATTGGAGGATTTTAGACATTGAGAACAGA 

TGTGTATTGTGGAAATATCTGTCTGACTGAAGTGACCAGGATGGTCCAAA 

AGAGCGAGAATTTGAGGCAAGCAAACCATCAGCAGGCCAGCAGCAGAAAT 

CCAGGTCATAAACAGGGAAGCTGAGGCTCACAGGGTTGGATCAGGGAATG 

GGAGAGGGAAGCCAAACAATTCCATGAGCATGTCAGTTGCACATATGACT 

TGGTAACTATTTTTATTTTTATTTTTATGTTTTGAGACAGAGTCTCGCTC 

TGTCACACAGGCCAGAGTGTAGTGGCATGATCACAGCTCTCTGCAACCTC 

TGCCTCCTAGGTTCAAACAATTCTCCTGCCTCAACCTTCCAGGTAGCTGG 

GACTACAGGTGCGCACCACTACACCCAACTAAGTTGTGTATTTTTAGTAG 

AGATGAGCATTCACGCTGTTGCCTTAGACACGG 
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AATATTGATTATTTGACCAGAAATTCATGCAGCTAACCGTGACCCCTGGC 
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JLaAAT AAAAT AGTGTAT* . ^ GTACGTGCATATACATGCAAAGAAATGAGx . 
GAAACTAGAAGGATGTCAATCAAATGATAACATGGTCATCTTGGGGTCGG 
AGTACATTTGGGGATGAGGGGAGCTGTAAAAGCAGACTTGGACCTTTTCT 
7 CTAC CAGTAC CGTGTCATTTGAATTTTGGAAAGAAAAAAAAAAACT CAG 
AAGGAGGAGAAGGAGCAGGAGGAGAAGAAGATGGATCTTAAGTGATTTGC 
CCGGGAGCACCTTGAGAAGGTGAGATTCAAGTCTAGGTCTAAGCTTTCTA 
ATTCCATGAGTGGGAGTGACCCACGTCCAAGAGGAAGCTCAAAAGGAAGA 
TGTTCTCCATCATCTCTTGCTCATCCTAACAGCATGCAAAACCACATCCA 
ATGCAGCTCAGAAAACTCCCAAATTGCCAAATTTCATTGGAAACACTTAA 
TGCTGTGGTTTCCAATTTCAACTGTAAAGTAGGTATGTATGCCATTGTTA 
CCATTAACTTCTCAGAAATGGAGAGAGCTCTCTTTCCGCCTCCTCCCCCT 
CTGCTGTGGCTTTGGTGAGACGTGCACTCAGGCTCACCTGTCTCCATGAT 
CTCCAGTAAGTACACATGAGCAGAGAGGCCTCAGCTCAGCTCTTCCTGGT 
CCCACCAGGGTTGATTCTTTGAGAATTCTAGAATGCCACATCCTAGGCCC 
CCCAAAGAAATCCTGCATCTTACCCCCAGAAATATGAATCATAGCAAATT 
TCAAATCAACCATCGTTTAATACTCACAGACTGGGCACATCC^AAAACAT 
ATTTTCAGTTTTACAACAGTGCCTGGTGCATATCGGCACTATTTGTGGAA 
GCAATAAATCGACACGGAGCTGAAACACAAACAAATGCCAAATTGTTTTT 
ATAACACCTGATTTTCTTTCTGTTTCTTTATGCAGTTTAGTTTTGTTTTG 
CTTAACTCTACCTCAGACCATAGTCTGGTAAACTCACCACCCAGAAGCTC 
CCTTGAAATGTGGGTATGCAGCCACTAGGTGGCAGGAGAGAGTTTCCTGC 
CTGGAGGGAGGACAGCCACTCTGTCCCCGGGTCAGGCCAGGGCCACCCTG 
CTACCTGCAAAATTAGCATGGGGCTTTATGAACCACAGCTTCCTAATAAA 
CACAGGATCTGTTTGATAGAGACTCCAAAACACGCCTACCTAGTGATGAA 
AGACTCAACTTCAGAAGAAAACCTTCATGGCAAACATCTTCAGAGATGTT 
TCCAACTTAAGGTTCTGAACACAGACGCTTCCCCAGAAAGCCATTGTTTC 
TCAGCACCTGGGAGCCTTGCTTTGCTTTGCTTACAGACTCGCTGTTCTTA 
AATCACTGCCAAGATAACATCTGTCTCTTCTCTTACCCTCTATTTCGATA 
TAAGGACTCCTCACTCTTGTTGCTTCCTATTGGCTACCTCTCCACAGGGA 
GAAATCGCTGATTTAACAGCAGTCAATATCCCAAATCTGGAACAGGGAAC 
AGGGAAGCATTTAAAAATTGGAGAATTTAGGCCGGGCACAGTGGCTCATG 
CCTGTAATCTCAGCACTTTGGGAGGTCGACGTGGATGGATCACTTAGGAG 
TTCGAGACCAAGCCTGGGCAACATGGCGAAACCTCATCTCTACAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAACCCAAAAATTAGCCGGGCATGGTA 
GTGCACACCTGTGAGCCCCAGCTACTCAGGAGGCTGAGGTGGCAAGACTG 
CTTGAGCCCTGAGGTCGAGGCTGCAGTGAGCCGAGATCACACCACTGCAC 
TTCAGCCTGGGCAACAGAGTGAGACCTTGTCCCAGATAAATAAATTAAAT 
TAATTTAATTAGAGGATTTAAGGATTTTCCCTACAGACACCTCCTTATTT 
TCTCTGGCCTTTTCTGACTACTCTCCCTAACTCCCTGCTCCTCTGGTCTC 
CCAAAACTACTCCAGAAAAAAAAAAGGGGGGGAGGGACTAAAGGAAAGCC 
AGGTGACAGTGCCAGTGTGACAGATGACAAAGCATCTGCCCGAACAAACC 
GTAGGTCCCTGAACTTTCTCCAAGACCTGTCTGTGGACTTACCTATGAAA 
ACCAGTTTTAGCAAAAACCCTCCTAAGCCAGTTTATCAAGATCCCCTTAT 
CCTCAATATCCATCTGATTGGATTCTTCATCCCCCACCATTCCCCAGTGA 
TGTCACCAGGCCTTTCTTCAGCAACAGTAGTTAGTGGGTGTAGCCAGGAC 
GCCCCCTCACCCCTGATATGCCCTTTTAGTAATTCTTCATCCACAGGTTC 
CCACCCTGCTCCTAGGCTATACATTCCCATTTGCCCATGCTGCATTCGGA 
ATTGAGCCCAGTTCTATACTGAGGTCTTACTTCACCTCTCGCCATAGTCC 
TGAATAAAATTGGTTTTCACATTTAAAAACTGTCCAGCTCTGGTTGTTCC 
TTGACACAGGGTAATTTTTATTCCATGTGATAGTTTGCCTTACCTCAGCC 
TACACCCCTCAAACCTGCAACTCTATATTCAAGAACCAGACAGCCCTTTC 
CAACAGATAGGAAGAGGCTGCCCTGGTGCAAAGGAAGAGGCTCTGGGAGG 
AAGGAGAGAACCCGAAGGCTGCCCCCTCCTCTAGACTGAGCTCTGGGATG 
GGTGGACGATAAAACCCAGATACGTTTAGACATCTGAGCGTGGAGAGGAC 
TTTGCTTTGCTTCCACAGGGACCCCAAGGAAACTGCAAGCCCTCCAGAGA 
CTAAAAACAGCAGAACAGCAAGAAATGGCAGCAAAGGTCTGGGCAGAATC 
ATCCTATGTGGGCACAGACACAAACAGAGTCCCCTGTGGCCCCAGGAGAG 
TTTAAAGAAGATCCAGAGGCTGTCCTATTCCATATCTCAGCAGAGACAGG 
CCCGTGAGCCTAAAAGCTGATCATTAGGACAAGAAGGACACGAACTGTCC 
ZGTGAACCGCGTGGAACAAGGCCAATCACCAGACACCAGACCAGC 
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CAGACACAGCCCCGCAGTTCCCCAAGACCACCACGGACCCATCGCCCCTC 

ACCAATAGCTCCAGGCTACATAGACCCCCTCCACTTCATGGATGTCCTCA 

GAGCAGAAAGGGGAGGCAGGAGTGGAACCCTGACTTGGTTCAGTTGAAAC 

ATAAAATGACTGTACTATTATTGAATTGCTGAAGTTTACGTGAAAGAAAT 

GAGATTTAGTTTTTGGCCACAGTGCAAAATAAGAAACGAGGCTTCAACTG 

AGATT AAGGTGAGTT ATAGGAAAATGT ACTC CCTTGAAGGAC CTGTGAAG 

TGTGTTCGCTATGAGAAAATGACCAGAATCCACGTTCTTAGCTGCGGGAC 

TCAGGCTGACTCCTGTTTCTGGAGCTTGCACAAAGGGCAGGGAAATCCCT 

GTTTCAGGCACAGTGATTTCAATGTTTAAAAGAAAACAGGTGGGCCCTGG 

CAATCATGATAACATGTCATAAGTTTACATCTCTGTGAGGCAGGTAGTGT 

AATCCCCATTTTGCAAAGGAGGAAACCGAGGCTGAAAGCAGCTACATGGT 

CTCTTCAATGTGGCCCAAATGTTGGAGAACAGAGCTTAACTGAATCAGCA 

ATTCTATACTTAGAACTGACTCTCTCTTTATTATATCTCACTACTACCTT 

GATATTTGAAATATTCAACTTTTTTCAATCAAAAAATAACAATAATTTAG 

GCATAATGACTACTATGTCATTTAATTTCTTGCTGATATTTCAATATCCC 

ATGCCAGGAATATTGAAAGCTCAGCTCCTTAAGAGCTGACTATGGCATCA 

ACTCCCAACAACCATCCTTCCAGAAATATTTTCCCCTTTCTTTTGTTATA 

GAGTGGCACTGCCCTATATGGTGACCACTTGCCACATGTGGCTGTTGAAC 

ACTTGAAATTGGCTTGTCAGAATTGCAGTGTAAAGTGTAAAACACATACC 

AAATTTCAAAGACATGGCACATAATAAAAAATGTAAAATATCTCATTAAC 

AATTTTTATATTGACTGTGTAAGTAACATTTTGAATATATTGGATT'AAAT 

ACATGGATGATGCCCCAACACCCACAGTCCCTTATCAAGTCTCTACTTCA 

CATTTTTGTACTTCTGACTTAGAAATAGCACTGGCGTCTAAGAGCCTATT 

AATGTCGTCAATAGGTTCTTGGGAACCACAATTTTAAACAAAATGACATA 

TAAGAAAACGAATAACATTGAACAAAATGACATTATTCGAGGACCTGCTG 

CATGTTGTTTCACTTAAAGTCAGTGTCCAAGAACCTATCAGTGACATTTA 

GTGAGGACTTGCTGTCCTTCCTGTTTACAGGAACCTGGGCAAGTTACTTA 

ATTCCTCTAAGCCTGGTTTATATCCCTGCAAAGAGAGAAGGATAATAATC 

ACCAGTACTTAGTGATGTCGTAAGGAGAAAATAAAATAATAAATATGAAA 

TGGCTGACAGTGTCCTTGTCACACAGAAGATGTGTGATCCACAGTAGCTG 

CTATTGTCTGCCTCACTTCACTAGTAATGGTCCAGGGAGGCCTTTAATGT 

GCATGGTGCAGTACATTCACATGTTGGACATGGGTGAAGGGAAAGACCAG 

GCTCATCTAAACACAATAGGATGCTTGTGGTGTTTTGAGGAGGAATCAAG 

GACTAGTTATCCACAGCTGTAACATGCATGGATCAAAAGAGATAAGGCAC 

ACAAAAGACTTTGTCAGTAGCAAAGCATTACAAAATGCAGAGACCAGCTG 

TGGGTGGTGGTGAGTCAGACCCAGCTTCCCTCTGTGCCTGGCTGAGTGGT 

TCTGGGCAAGTCACGCCATCTGTCTTGATGCCCTTCCCCATCTATAGAGA 

GGGAGCAACTGAGGCCCCTTCCAATACTGAAGTCCTTTATTTCTGCTACT 

TTAGAAATATCCACATTTTTGGTAAATTCAAATGATCCAATGATTCCATT 

TCCTAATGTTCAAAACTAGCCCCAGAAACATCTAAATGAATCAAACAAAT 

AAAATATTTATTGTGTATGTTTTGATTGCTGAAACTTCTATTTTAGCAAC 

ACACACACACAC2ACACAGAACCCATAAGCCTTCATCTTTCCTTGGATAAA 

CGAGCCTTCCTGTCTGGCCATTTAAGTCACGATTAAGTAAATGATTTCCA 

ACTCGCCTTTTGCAGCAGTTCAGATGGGTCTTTCCTGCGTGGCAGTGGCC 

CTCCTGACTTATGATTTCCTGTGTGTCGGCCTGTTACCACTGCAGCTTAA 

CTGAGGAAACAAGAACAAAACAGCTTCTGACCCCAAGAGACTGTTGGAGG 

CAAAGGCTTCAGTCCCAAGAACCTCACACGTGGGGAGCCCGAGAGCCCAG 

C CCTGACCTTTTCTCCAGTAATAACATAAGAAACAACAGGCACTGGC CTT 

ATTTTGGATACAAAGAGTGGTGCTTTTCCTTAAATCTTCCTTTAGTCAGG 

GCTACCCCTTCATGGACGCCCCAACATCCATGGTTCCTGCTTGAGTCCCT 

GCTTCCATATTCCTGCACTTCTCACTTGAAATATCCCTGGAGTACGTTAA 

GCAGCCAGGTTTGGAAGTTCTTGCTGTGCAGGCGGGTGTGTGCATGTCCT 

CTCTCTCAACAGGACACAAGCTCCCCAAATCAGACGGTATGCCTCCACGC 

CCCTTCCCAAGCCTCCCCAGCAGCACCGAGCATGTGAGGGGAGCTGGGGC 

CCAGGCCATGATGGGAAGCACTCTCTGCCTAAAGACTAGGGTGATGCGCC 

CTCAACTGTGGGAATGAGCCCCAGCTCTGGTGTCTGCCTCGGTTTTTCCT 

CCTGGACAATCAACATGAACTCCTCACCCCTCTTATCCACTTTGCATAAA 

CTGAAAATAACAAACCCAGGGCTCTTTCTGTCACAGGAAAGGGTTTTTTT 

TTAT AAAATTAAACAG AG ATGATT CAACACAC C CAGGAT ATAACACATGG 

GCCATGAATCAAGGGCAGCATTGCTCTGGTCAGCCTGTTGTTTGGGCCCC 
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CTTGGCAGGGCTCTCCCC.rGAATCTTCCCCTCTTGACTCCCATCANC^'^ 

GCACTCCANCTTTGTGTTACAGGCGATAAATGGGAAAGGGGTAAAT 

>Concig48 

CATTCTTAATTAGAGAAACGCTCATTAAACTAGACACCCAAATTCTCTGG 

GGGGGGATCATTCTTACAAGCATGCCCTTCTCTCTTAAAGAGAGAGCACT 

TTTTTCGCAAATAATGCTGCCATGAACATACGGGGTGCATGTATCTTCGT 

AAJAGAATGATTTCTATTTTGGGGGGTATGTACCCAGCAATAGGATTGCT 

GGGTCAAATGGTATTTCTGGTTCTAGATCTTCGAGATCTTCCACACCGTC 

TTCCACAATGGTTGAACTAATTCACATTCCTACCAACAGTGTGAAAGCAT 

TCCTATTTCTCTGCAACCTCGCCAGCACCTGTTATTTCTTGACTTTTTAA 

TAATCGTCATTCTGACTAGCATGAGAGACAGTATCTCGTTGAGGATTTGA 

TGTGCATTTTGCTAATGATCAGTGATGTTGAGCTTTTTTTCATATGTTTT 

TTGGCTGCAAGAATGTCTTCTTTTGAGAAGTGTCTGTTCATGTCCTTTGC 

CCACTTTTTAATGGGGGTTTGTTTTTTCTTGTAAATTTGTTTAAGCTCCT 

TATAGACTCACAATAACAAAGACATGGGATCAACCTAAATGTCCATCAAT 

GATATAACGGATAAAGAAAATGTGGTACATATATACCATGGAATAGTATG 

CAGCCATAAAAAAGAATGGGATCATATCCTTTGAAAGGACATGGATGAGC 

TGGAAACCATGATCCTCAGCAAACTATGCAAGAACAGAAAACAATTGTTG 

CATGCTCTCACTTATAAGTGGGAGCTGAACACTGAGAACACAGGGACACA 

GAGAGGGGAACAACACACATTTGGGGCCTGTCAGGGGTGAGGTGGGGGAG 

GGAGAGCATTAGGAAAAATAGCTAATGCATGCTGGGCTTAATACCTAGGT 

GATGGGTTGACAGGTGCAGCAAATCACTGTGGCACACATTTACCTATGTA 

ACAAACCTGCACATCCTGCACACGTACCCCAGGACTTCAAAATAAAGAGA 

GACAATACTTCTCCCTTAAGTGTCTACTGTTGCTTTGCAATAAAAAGTTC 

CTGCCTTTCACTTCACTCTGACTTGTCCCTGAATTCTTTCTCGTGATGGT 

GTCAAGAACGTGGACACTGGCTGGGGCTGGAGACTCACCAGCATCCGGAG 

ACCCTCCTGAGCCCTCCAGCAATACAACTTTGACACAAACTATGAAATCA 

CAGATCCAAGAAGCTCAAAGAACCCAAGCACAGGAAACATGATGAAACTA 

CATGAAGGAACATCAGAATTGAATTGTTCAAAATCAGTGATAAAGAGTAA 

ATCTTAAAAGCAACCAGAACAAAATATCCATCATATACGCAGAAATAAAG 

ATAAGTATGACAGCAGATTTACAAATAGAAAAAAAAACAAGTGCAGCAAC 

AGAAACAAACTATCAATCCATAATTCTATACCTAGTGAAAATTTCTTTCA 

AAACAAAGGTGAAATAAAAAAATTATTTTCAGGAATACAAAAGCGAAAAA 

ATTAATCACTAGCATTCATCACTGCAAGAAATGTTAAAGGAAGTCCTTTA 

GGCAGAAAGAAAATGATACAAGGTGAATATTTGGATCCCTGCAAGGAACT 

AAAAAGATCCAGAACTGATAACTTAATGGGTAAACATGTAATTTTCATCA 

ACAAGTGAATGAATAAACAAATCATGATATATCCATATGATAGACTACTA 

CTTAGAATACAAAAGAAGAACTACTTATGCATGTGATAACATGAATGATA 

TT CAAAATTATTATTGAGTGAAAGACAC CAGATCAAAACAAAGT ACATAC 

TGTATGATTCTGTTTATATAAAACTCTATAAATTGCATGCTCTTCTATAG 

TGACAGAAAGAAGATCAGTGGCTGCCTGCAGACAGGAAGAGATTACAAAC 

GGAAATGAGAATTCCTTAAGAGATGATGGACATGCTCATTACCCATCATA 

TGTATACAGCCATAATGGTTTTACAGATACATATATATGTACACGCCAAC 

ATAAATATAAGTTATCAAATTACAGTAAGTTCTGACTTAATGTCACTAGG 

TTCCTGGAAACTTTGACTTTAAGCAAAATGAXGTACAGTGAAACCAATTT 

TACCATAGGCTAATTGATATAAAGATGAGTTAGGTTTTTGGTTTTTTTTT 

TTTTGACATGAAGTCTCGCTCTATCGCCCAGGCAGGAGAAGAAGAGTTAG 

GTTTTACAGCATGTTTCTGGTCACAAGAACATCATCAAACTTGTAAATAA 

AGGCACAAAACACTTCTAATATTAAATATCAAAATAAATATGAGTTATAC 

AGAATTTAAGAAAGATTAATAAAAACAAGTAAAATCATTATTTATGGGAT 

TTTTGGTAATCAGTGAGTTATGTGGTCATAGTGGAAGTGGGTTAAGTCAA 

GAAATAAATGTTTGCAAAACAAAAATTTTAAAGATCCTCTCCTACCACCA 

CACAAAAAACAAGAAAACACGGTGGGCTCGCTAAGCACTTTTGTACCACT 

CGTATCTTATGCGTTTGTATGATTATTGTAAATGCTTTATGATAATTTTT 

AGAGACAGGGTCTCACTCTGTGTCTCAGGCTGGAGTGAAGTGGTGCAATC 

ATAGCTCACTGCAGTCTCAACCTCCCGGATTCAAGAGATCCTCCCACCTC 

AGCCTCCAGTGTAGCTAGGACTACAGTTGTGTGCCACCATGCCCATCTAT 

CTTCTTTTTTATTTTTTGTAGAGACAGGGGTTGTGCTTTGTTGCCCAGGC 

TAGTCTTCAACTCCTGGGCTCAAGCAATCCTCCTGCCTCAGCCTCCCAAA 

ATGCTGGGATTTCGGACATGAGCCAGCAGCACCTTGCCCAGCATTTTATT 
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TCATAATAATTATAAGTCATTCCTTCATTCATCTTACAACCCACTTGTTC 

CAGTTCAGGATCTCGGGTGACCAGAACCTATTAACGTTCACGCACAAGTC 

AGAAACCAGCCCTGGACAGGACACCATCCTACCGCAGGGAGAACTTACAC 

ACCCACACTCACTCAGACTGGGACCATGCAAAGAACCTAACGTGCACTTT 

GGAATGTGTGTTCCATACCCACTAGAACAGCTAAAATTTAAAAGACTGAC 

CATACTTGAGTGTTGAACAGGATGTGACACAACTAAATCTTTTAAGCGCT 

TC^GCGTAAATGGCACAGCCGCTTTGGAAAACAGTTGGCAGTTTTTCAAG 

TTAAATATACCCAAACTCTATGATCCACTTCTCAACAATCAAACAAGAGA 

AATAAAAGCAATGTCTACACAAAGATGTATACACAAATGTTCATTGCAGC 

CTTAATTATACTAGCCCCAAGTTGAAACAAGCCAAATGTCCATTACCAGA 

TGACTGGAACATACAAATTGTGGTATATTGATACAATGAAATACTACTTA 

GTAATAAAAAAGAAAGAGCTATTAACATAAGCAACAACATGGATGAATCT 

GAAAACAATTATGCTAAGTGAAAACAGCCACACAAAAGTTACATACTGTA 

TGATCACATCTACATAAAATTACAGAAAAGGCAAACTAATCTATAGACAG 

AAAAGCAGATGAGTGGTTACCTAGGGATGGGGCAGAAGGGACGAAAGGAT 

GGATTGCAAAATAGCACAAAAATATTGGAGGGATGACAAATATATTCATT 

ATCTTGATTGTGGGGATAGTTTAATGGGTATATATAGAGATCAAAGCTCA 

TCTAATTATACACTTTAAATATATGTATTTCATTGTGCATCAGTTATTCA 

TCAACAAGACTATAAAATAATATATGCCTACATACATTTTTAAATATTCA 

AAATCTCACAGTTATATACATAAATGCAACTGAATATGTATTCAGATGTT 

TTAACAAGCAGAAAGGACTGATTAAACTCATGACAGCGGCTGTTTCTGGG 

AAGGGTGTAGGAGACAAGAGATGGAAAAGAGGATGAGAGCCAGAAGAGAC 

CCTTGTAATGTTTCCTTTCTTTTAGTAAAAATATATTGACAGTTAAAGCT 

GAGAGGTGAGAATAATAGTCTCATGGCTTTTGTGTCCTTAAAATTTCACA 

AACTAAGTGAAATGGGAGAAAGCAAAAAAATAAACTTAAATAAATGTTAT 

ATTGCCCAAAAAGAGATTTAAAATGGAGGTTAGACACATGAGACTTACGT 

TCTCAAAAAAGTAGAATCTGCAGGGAAGTTTAACAACTATAAAGAATTAA 

AATCTAGCTTCTACCAGCCCAAAGCCTAAAATGTTCTGCTTTATTCTTCC 

TTATTATAATTCATAGGTAATATATTTTATGTTTGCAAATGAATGCAGTG 

ATATTAGATCTCTAAGAGGTGCTAAAAATGAAAAGTAGATATTCCAATTT 

TTCCCAATTTTCCTTCTCTTTCCATGAATGAAAAATATACATATTTGATG 

ATTTCCAAGTTTATACAACCGATCTTTCTCTTAGTTTTCTCTTACCAAAT 

TCCCTCCCTCACTCAGCCACCAGCCAGTCCAACTGTGCTACCTGCACAGC 

AGCCCTCATACCATCCACACTCTCATCAGGATCCTGCCTGACCTGCGAGG 

AGCAGCAGCAAGAAGGAGACAGAACCTCCACGCTGAGCATCTCAGGGCTT 

TCTCAGAGACTCCAGAGGACCCTGATAGGGACAGAGCCTGGCCAGCAATC 

CATGCTGCCAGCTGTATGATTGTGGGCATGTAAATTCTCAACTGAAAATG 

GGTGTAATAATAACATGTTCTTCCCAGAATGAGCTTTATGAAGATCATAT 

AGCTGTTTGGAACTCAGACAAGCACTGGTAGGAATACAAACAGGGGAGCC 

AACAGCCTATAAATAATACTTTAAGAAAGGGCATGAATGTAATTACTTAG 

GAACAAAAGGCAAAGTGGAGAGATGCCTAGGACTGAGCTGGACAAGCTGC 

ACCCTTTAGTGGCTCAGCCCATGGGCTGACAAGGAAAATGGAGGAGCTAC 

CAAAGAAGGTGGAAGGATTCTGGGAGAGTGGCCCTCACCCTGCCCAGGGC 

AGGGCTCAGTGGGAGAGAGGGAGATCTGTTATAAATGCTGCCAGGAGGTC 

GAGTCATGTGAGAATGTCCATGTGAAAACATCCACTGTGTGTATCTAAAG 

AGAGTGGCTGTAAAACAGGTCAGGGTCAAAGGTCTTATTGTCTCAGATGT 

TATCTGCATGCATTGTCTCACGACCAAGAAAACTAAGGAGCATGGACACA 

AAGGGTTAGGTTGAAGCAAAAATTTAATAAGTGAAAGAAGAAGGCTCTCT 

GCAGTGGAGAGGGGAGTCTGAGTGGGTTGCCACTTTGACAGCTGAATCCA 

AAAGCTTTTATAAGAAACTCTTCTCATATCTGCAGCTGTTTGAGTAACTT 

CTCTTACCTATAAAACTGTCTGTATAACTCTCCCTTATCTATGCAGCTGT 

GGGATGTCTCCAGGTAAGCATAAAGTGTAGCTTCTCTTGTTTGTATAACT 

GTGGGTTTGTTTTAGGCAAGCCCCCATCCCCTCCCTGTGTAAGCTCCCAT 

GGAGCCCACCATGTGCATATCTGAGAAGTGGAGGAAGCTTTCTCTGGGAG 

CTCACTGATCGTACAAAGAACAAGAGGCTTCTGTGCCGCTTATCTATTCA 

GGTGCAGCCTGAGTTTTCCCCAGGCTGCTCTATTTTTGCCTGTAGCTATG 

ATTTTTCAGGCAGGCTGCTTCTCTGAAGACTAGCCTTAACTGTCTACCTA 

TCAGATTTTTCCTTTTCTTCTCCCTCAGCTGGTTCCCCTCACCAAGGCTG 

AGCAAGTGAAAAGGAGGGCACAGGGCAGGCCAGTAGTGAGCAGCAACAAG 

GAACTAAGACAGCAGAAACCACTCTTCACACCTGGGTTGAAAGGGGTGGG 
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GAGCCAGGACTACAGC1 \-AGGTAAGAACATAGGTAAAGAGATACTGTTGT 

TGTGTTGTTTTTAACTATGAGAAGCATTGAGCTTTAAATTTCTACAGGAA 

GGATCCAGTTCAGACAGGAGCACCCAATATTCAGAAGAGAAGAACATGGT 

GTAAAGGTCCTGGGAAGGCTGAGAGGATTGGGACTCAGAATCCAGAGCAG 

AAGCCGTCTGTGAACAGAAGAAGGACCTCCCCCAGTGTAGCAAGAGGGAG 

GGAGGAGGGACAGATGCCAAGATGGTTCAGGAAGAAGGTTTGGTGGTAAA 

TGTGAGGCTGTGCTCACCTGCTGGCTTCAATTTTCTCTTTAAAATGTCAG 

ATGGAATCATTTGATGAAGGCCATGCCATGCAATGAAATGGCAGTCTGAG 

GCATGGAGCAGCTCCAGCTTAGCCCGTGTTTAGGGTAATTATGGCTCCAA 

CCCAGGAGATGAATATGACTAGGGAAAGTGAAGTCCAAAAACAAATGGTC 

TCAAGTTGACTGTGAGTCTTCTGGGAGGCTGAGACGACAGGTGGGGTTGA 

CAAGGGAAGGGGAACCCACCTGCTGAAAAACATCAGGCTGTTGGCTGGGG 

GAGGGGTGAGGC CTGTGTTGTAGAGATGGATGGATGC CTAAAGTTGGGTA 

AAGGTTTCAACTCTACCCTCTGCTGGGTGTGGAAATAAACAAAGACCACC 

CAAATGAGAACAAACAAAGACTATTTATCCAGAGCTTGCTCTGACAAGGG 

AGTCGGCAACCATCACTTGCTTGGCAGAGACTCAGAAGTAAGCAGGGGAG 

AAAGCCTCATAGCAGAAAGAAGGGAAGTCTTCATGTATGCCCTGAGTGGC 

AGCTGTAGATGTGGGTGAGTTGCAGGTGGCTAACTAGAAATGGGGGACTC 

CTGTGTGATTGATTAGGAGCATGTTTGGCTTTCTCTGGTTGGTCCTACAT 

TGGAAGAGGGAACAAAAAATTTAGGGCAGTTGTCAGTTATTAATCAAGTG 

TTGGCCATTTTTGACTGACTGTTACAGGAGTGACTGGCTCCCTGGATTGT 

TTGCTAGAAATAGTGGTCTTCACTTCCTGCAAGTCTGACTTTCTGGTAAT 

AGGCTTCCTGGGTTGGCTATTGTGGATAATAAGTGGGTTTCCTGAGCTGA 

TTTCTGCAGATTGTGGATCAGAGTTATTTTATATAAACAGTCTGACCATT 

TTCCACTGGCATATTCCATCTTCCAAGAGCTGGCCAAGCTGCTGTCTTAT 

CTGTCTCCCCCAGCCCCTCCACTCTGGCTGTGAAAATACAAGCCACTAGG 

TGAGGAATGGGGACAATTGAAGACTGAAAGCTTTTCTTTGCTGGGTTCGC 

AGAGCTGAGGAAAGAAATGACAACATCCAAGTGTCTGCCCTGGGCCAGTT 

TTAGGACTGTAGTGGTAATGCAAGGACTGTGTGAGTTTATATTTTCATTT 

GTCTCTCTAACTAAGGTGGAAAAAAAAAAACAGAAAATTGTCTGTCTGCA 

GTCTCTGCAAAAGTCTAACACTGTGCTTCCCAACATTGCAGCCATTAGCC 

ACAGGTGAGTATCAAGCACTTTAAATGAGACTGGTCCAAACTGAGATGTG 

CTCTGAGAATAAAACACACAGCAGATTTCAAAGACCTAGTACATGCCCTG 

ATTTCAAGCTATATTACAAAGCTGTGGTAATCAAAACAGTATGGCATTGG 

GAAAAAAATAGACACATTGGTCAATGTGACAGAATAGAGAGCCCAGAAAT 

AAACCCGTGCATGTATAGTCAACTAATCTTTGACAAGAGTACCAAGAATA 

CACAATGGGGAAAGTCTCTTCAATAAGTGGTGTTGGGAAAACTAGATATC 

CACATGCAAAAGAAAGAAATTAGACCCTTGTATTACACAAAATCTAAAAT 

TAATTCAAAATAGAAAAAGACTTACATGTAAGATCTAAAACCATAAAACT 

CCTAGAAGAAAACATAGGGAAAGAGCTCCTTGACACTGGCATTAGCAGTA 

ATTTTTCAGATATAACATCAAAAGTACAGGCAATGAAAGCAAAAACAAGT 

GAGAGTATATCAAACTAAAAAGTTTCTGCACAGCATAAACAATCAACAGA 

GTAAAGACATGACGTATGGAATGAGAGAAAATATTGACATCTGACAAAGG 

GTTAATATCCAAAATATATAAGTAATTCACACAACTCAGTAACAAAAGCC 

AAATAACCTGACTTTTTTTTAAAATGGGCAAAGTACCTGAATAGGTATTC 

CTCAAAAGAAGACATACAAATGGCCAAGAGATGTATGAAAAGCTGCTTAA 

CATAACTAATCATCAGAGAAATACACAAATCAAAACAAGATATCATCTCA 

CACCTGTTAGAATGGCTATTATTAAAAAATGAGATAAGTGTTGGCCAGGT 

GTGGAGGAAAGGAAACCCTTGTACATTATTCATAGGAATGTAAATTAGTA 

CAGCCATTATGGAGAACAGTATGGAGATTCCCTAACAAAATTAAAAATAG 

AATTACCATATGACCCAGCAATTCCACTTCAAGGAATACATTCAAATACT 

ATCAGTATCTCAATAAGATACTTGCACTCCTATGTTCGTTGCAGCGTTAT 

T CAC CAT AGC CAAGATACAGAAACAAGTTAAATGT C CAT CAACAGAT AAA 

TGGATAAAGAAAATCAGGTACATATATATATACAATGGAATATTATTCAG 

CAAAATCCTGACATCTGAGATAACCTGGATAAACCTGGAGGACATTATGC 

TAAGTAAAATCAAAGCCTGACACAGAAAGACAAATACCACATAATCTCAC 

TTACATATGAAATATGAAAATGTTAATTTTATGGAAACAGAGTAGAATGG 

TAGTTGCCAGAGCCTGAGAGTAGAGAAAATGAGATGCTTGTCAAATCAAA 

TCATCACATTGAATATATATAATCTATTTGTCAATTAAATATTTTAAGAA 

TAAAAAATACCTGGCACCAAAAAAAGAATGCAAAATGTCTCAACAATGTT 
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ATATGTATTGCATTTTGi-AGTGATAATAATTTGAATATTAGGTTAAATAA 

AATATATTTGAAAAATTAACTTCACCTATTTCTTTCCATTTTTGTTAACA 

TAGGTACAAAAAAAAATTAAAATTACCTATGTGGCTCATGTAGGTGGCTC 

ACATTATACTTTGATGACACTATACAGGCTGGTGACCATATATCTCTTAG 

ACTAGTCTAAGTGATTTAACAGTGGTTCCAGAAAGATCCAGGTTTAACAC 

CAATGAAAGGGCCAGCTGGCTTAGCCCAGCTTGTGTGGGAAATGTTGGGG 

AGTCGGTTTAAGACAGGGAAAAGCAAAACTTTTGATGCTATTGACTTTTTG 

AAAAATCTTTTGTGGCTGAAAAAACCAAAACATTATT 

>Concig4 9 

GCTCGAGTGTGTCTCTAAAGCCTTTCCCCCATTGGCTCCACTATACGCAC 

TCTCCTGGTTTCCTCCCCTCTAGCCGCTGTCTTTGGTCTCCTTTCTGATT 

TTGCTGCGTCCTCTGTCCCCTGAATGATTGCTTCTCCACTACGGGGTGAT 

TTTGCTCCCCAGGGGACATTTGGCAATATCTGGAGAGGTCTATGGTTGTG 

TTTGAGGGTGTTGCTACTGCCATCTAGTGGGGAGAGGCTAAAGATGCTGT 

TAATGCCCAGGACAGTCCCCATAACACAGAATTATTCAGCTCAAAATATC 

CATGGTGCCAAGATCAAGAAACCCTGCTCAAATATTAGCATGTGCTGAAG 

GCCCTTCTCTTTCCTTTAGCAATATCTGCCTCCTTAGGGATCTTTTCTAG 

TCTCAGTGGTTTAACATTTAAAATCCCAAATTAGGCAATAAATTGGGCCC 

CAAACTTCGTTAGTATAAAATGTAGAACTGTGTTATTAGAAGGCTAATAA 

AATGACCTGGTGAGCATCTGCAGCTAGCCTCTGAGCAATTCTGGGGACCA 

CGTGCAAGATAAATCCATCTGTTCCCTCTCTGTAATGTGGCGCTACCTTG 

TGGCCGATTTTTCCTCGGGTTAAATATCTCTGGGGATGCAACTTGTCGTG 

GTTAATGGCTGTGTGAGGCCAGCGCGTGGTGATAAAGGAATCAATCAAGA 

CAATATTGAATTTAGAAAGGCAGATTTATTTAGAGAAAAGGAGAGATACG 

TTGCAAGGGAGCAATGGGCAATACAGCAGAGGGAAGGCTGTCTGCAAAGA 

GGCAAGGGCTACGTATGACGTAGGGCTGCTTAGGCTGAATGCTTGCAGAC 

AAGATGCTTGCGTGCAGGTGGGCTGTGAGCTGAGTGCTTGGGTGCTAGTG 

AGCCATTGGCAGCTGACCCTATTTCTTGGAACATTCGCTCCCTGCAAGCA 

TTTTAATGTTAAACCGCCAGGTCAGTTTGAATTTTCTTTTTTCTTTTTTT 

TTTTTTTTTTTTGCCTTTAGTAGGACCTGCCGTTGTGAGACTATCTGAGG 

TAAATTAGACACCCTCCTGGTTTAAGTCACCGCTCCAGTGACTAGGCAGG 

GAGCTCTTCCTTGAAGAGGGTGTGGGCAGTGGGTACTTTGCATGTTGTCC 

ACACCAGGCGAGCTGCTGCTTCAGGGCCTTTGCATTTGCTCTTTTCTTTG 

CCCAAAATGCACTTCTCTCACTGTTCACATGATTTTTCTCCCTCTTTTCC 

TTTTAGTCTTTGCTTAAATATCACCTTCTAGGGAGGCCTTCCCACACCAC 

CTCTTCAAGATTTGAGGGTATGCACCCCCACCCCTAGCCTTCTTATCCCT 

CTCCACTGCTTTCTTCTCAAAGCACTTGTTACGTTCAAATAAAATAGATT 

AGTTACTTTATAGTTCTAATTTTACTATTTTTTGTTTACTTCATCAATAC 

CCATGTAATCTCTGGAAGGAACGTTTCTTTTTGTAGTGTATTTCTAGCAC 

CTAGAACAGTACTTGGCACATGGCAGGTGTTCAAAAGTATTTGTTGATTA 

TTTTCTCAAAGGGCATGGAGTCTTAGAAGTTTGAGAACACAGTTCTAAGC 

ACAGCTGTTTAGAGACTATGGATGATGCTAATGGCTGTATTCCCAGTAGG 

TGGGGCAATTCTCAAATTGACCTGGAATCCTTGAGATCTGGGGACAGTCA 

CCAAGCACTGGGCTCTGTGGGGAGAGATGTGCTGGTTTTTAGAGAGGAGA 

ATAGCATCCTGGGGGACTTGGCCCCAGGGCTTTCCTGTCCCAATCTCTTC 

CCAACTGAGTCCCAGAGGCAGGAGGCCTTGTCTGTAGCTGGTCAGTCCTG 

TAACTGTTTCCCTCCCATCTACACAGATGCAAAGAAGGCTGAGAAAAGCA 

AGCTGTCAGGTGAGCAGGGGCCCTGACTCCTCCCCAGAAGGCACTCAGAA 

CTTCCATAGGGCAACTGGAAAGAAGGTTCTACTTCCTCACCGGCAGCTGT 

TGCTGGGGAAAAAACCAGCCTCAGGCCCTACCCTGTGCTGAGAACCTGAA 

TCCAGTATCAGGTTCTCCAACAAACTTGGATCCAGCTGACCCTCACAAGG 

GGTCAGATGCAACCTTGTAGCATATGGAAAATGGCAGCAAGGTCCTTGTG 

TGGACTATGCCTAGAATCTAAATTAAGACAAGGCCTCAGAGGGGCTAAGT 

GACATCTGTCTCCAAAGTTTCACAGCTAGTGTGTGACTAAATCTTGATTC 

CACCCTCTCAGGTTTTACCATAATCCCAAAAAAGGTTGAAACAAGAAAAG 

TTATCTTTGGGCAATTACCTCTTTCTGTTCCTTGCTTTACCTACTAATGT 

TCTAGGCTCACCCTCTGGTCTGCAATCTCACTGAACTGACAGATCCCTCA 

TGGCCTAAAGGGTTTTCACACTGGGTTGACTAGGCTCTCCCATTGCCTGT 

CCTACTGTCTAAGGCACCTCCTGGGTAGGGTGCCCAGCGTCATTCTGATG 

CTGCCTGACTTTCCTTCCAGCTACTTTTGAAACTTGGTATCCATGGCAGA 
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GGCTTAAAGGGGATGTTC-JAGCT 

ATCAAGGAAATCAGCATCTCTGGATAGCTCTACTAAGGCTTAGTTCTCAT 

TGTCCAATCTAGCTCCTGGGTCATGGGAGGCATTCAGGAAATATTTGAGT 

GTAAGAGTGAGTTGCTTTACCTCCAGAATATCCTTCCAATGGCTCTGAAG 

CAGGCTGTGGAGTCCTGCTGGCTGATCACAGTTCACAGGTGGCTCCCAAA 

CCTGTGGTCTACATCCATCCTTTGTCAGTGTCACTGCCATTGTCCCACAA 

ATGTCATTTGGGCCTAGCCCCTGGGATAGTAATCAGTCTTTACATAGATA 

TACATTGTGCTTTACATCCACAGTAATTCTGAGTGGACCTTAAAATAAAT 

TCCATGTCAGGTCTCACCAGCCCATGGGTTACAGATGGGGTTACCTTTCA 

GCCTTGTAAGGTGCCCCGTCTTTGAGTGTAGACATGGACTCACAACGAGT 

CCACTCCTGCTGTTCCTCTGCTCTTGCTGAGGCTTCTGCTGCTGCTGCTG 

CTGCTTTGCAGAGGCTGGCCAGCTGTGGTGCCTGAGGCACCTGTGTCTTC 

ACAGCACCAACTTGCATGGTGGCCACGGTGTAGTTGGAAAGGGATGCTTA 

GATGGGAGGCCAATGGGAGCTGCTTCAGGAGGCAAATCCAAGTCACAGAG 

ATCGAGTCACCGAGAGCATAGTAAACTCAAAATCCCTTCTTCTGCTTAAT 

AACTGAGATGCTGTCACTGGGTTAACCTCACCAAGCCTTGTTXTGTCTTC 

ACTTAGAGTGATTTCTGTCTTAGAAGGCTCCTCATATCCTTCTGGGGAAG 

GCTTCTAGTGAGTCCACAGATAGCTGGACCAGGCATGTCCAGAAATAATC 

TGATTCTCACATTTGAGTTAGCCAGCGTTCCCAGCTATATCCCCATTTTG 

TGTCTATATAAGTTACCAAAGCCCACAAGGATATTAGGTGGCTCCTTAGT 

TTGCTTTATGATTATGCCTTGTGTGTGTGTGTGTGTGAGTGTGTACGCCT 

ATGAGGATTCCTTCTCTCCCGTTCTTGCTATGGCTTCTCTTCCCCACTGA 

TGGGCTGTAGTTCCCTGTCCTTTTGACTTTGGGCTTAGTCATGTGACTTT 

TTTGCCAAGGGAATGTGGGCAGAAGTAACTGGGAGCCAGTCCCAAGCTAA 

GGCCTTGGGAAGCATGGTGAGCCTATGCCAGCTCCCTCAGAACTCCTTCC 

CTTGGC CATGAAGAGAGAATAACCTGGATTGTAC CTT CAGC CCATGTCCT 

AGAATACAAACATGGAGAATAATGAACTTGACTCAAAGGCTGAAGGGCAG 

CTGAGCCCACATGAGGTCAATTGAACTGCAGCTACCTACAGACCTGAAAG 

TGAAATAAACATGTATAAGTCTCTGACGTTTGGGGTTTGTTTACATAGCA 

TTATTGTAGCAGAAACTTAAATAATACTGGGGGCTAAATATAGTGGACCA 

GTGACAGCACAGAATGGTAAAATGGAGTGATTGTTACTTACATCACAACC 

CTTCATCTCTGTTGATGGACACTAAAATCAAAGTGGCAATTACTCAGAGT 

TGGGAGTCATTGAGTTGCATCATTGTTGTTTAGAATCATTGACAGTTTGA 

GCTCTAAGTGATTACAGAGATGGTTTCCTCAGCTACAGGTAAATAAACAA 

AGGCACAGAGAAGTAAAGTGACTTCTAGAGGGCTTCATTGATATTTAGCA 

GCAGAATCAGAGCTAAACAATGAGTCTCTCATCTCCAGCCTTTCTATTCT 

TGTTTCCTAGGTTGGGATTTTGGGAAATAGTGCAGAGAGATTAGCAGTAG 

TGACATGGAACAATGTGAGCCTCAGCTTCCATCCCTGAGGCTGCCTTCAT 

CTGCCAGGGAAATGTCTCTGTGTGCAGCCTTGCCCTCTGCACACAGTGTG 

TATGGCCACCTGAATAAGTGTCCTTTCATAGCGACTAATGGATTGAAATG 

GGTGCTAGAGCAGTGCTTCTAAAAACTCCATGTATTAATCATCTAGGGGT 

CTTACCAAAAACGCATGCAGATTCTGATTCAGTAGGTCTGGAGTGGGGCT 

TGACATTCTGCACTTGTAACACATGGACCACACTTTGAGTAGCAATGTAT 

TAGATCATTCCAGTGGAAACATGTATGAGTGATGGAATGAACAGATATAA 

TTAATCCAGGTCTGGTAAGTGAGGTACTGATACATATTAAGTTGAAGTGA 

ATTTCACATCAAAAATAATGGTTACACAGTGACTTTTACTGCCCCCAAAT 

TCTTTCCTTTTGAGTGGTTTCAAAGTGAACTGAGCCAGCCAGGTTAAGTC 

CCTGGTTTAGTGTGTGATTAGAAGATTTGATCCAGCTTTCTCCTCCTTCT 

AATTCTTTAAATATGCAATGGCCTTCTAGAAACTTGTCTCTCAGGCTCCC 

CATGAGCCACCTGTCTTAATATCTTCCCCCCCAGGACATTTCCTGGGTCA 

AGGAAGGAATCAGGGACTAGGAAAAGTAGAAAGGTTGCCTGACAGTGAGA 

AACTTTTTGCACTCCTATTTGTTCAATTCTAAAATGTGGGTATTGTTGGG 

GCTTCTAATTGGAATCTAACCTGAAATTCAGGCATGTCTAGCTATATATG 

AC CAAGAATTAGGATGAGTTCACTAGAAGCCTATTTTCAGGAGAGCGGTC 

AGTTAAATTGAAGTTTATGGGTTTATGGTAATGGGTTGGGGAGTTTACTT 

CATTAGCAA^GCAACGTTTTTGAATCAGAGAAGTGATTTTGAACACACT 

GTACATAGTTTTCTCACTTAGATTTATCTCTGGGTCAACCCTTGTTGGAC 

CTATATTAGAATCATTTAGTGAAGAAAAGGTGGGTGTCATTAGGAAAAGA 

GCCATTTATTCAAATGTTCTGTTTGACATTAGGGCACTGGCAAGACTACA 

GAATCAATAGATATTTAAAAACAGCCAGGTGCGGTGGCTCACGCCTGTAA 
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■ : TCCCAGCGTGATTTGGG .rTACTTTGGGAGGCTGAAGCGGGTGGATTG _w 

TGAGCTCAGGAATTCAAGACCAGCCTGGTCAACACGGTGAAACCCTATCT 

CTACTAAAATACAAAAAATTAGCCGGGCATGGTGGCAGGCGCCTATAATC 

C C AG CTACTTGGGAGG CTGAGGGAGGAGAATCGCTTGAAC C CAGGAGGCG 

GATGTTGTCATGAGCTGAGATCGCGCCATTGCACTCAAGCCAGGGCAAGA 

ATAACAAGACTCTGTCTCACAACAAACAAGCGAACATACGAAACAAACGT 

AACATCCAAACTAGCAGGTACATGCCGTGCCAGTCATGACCCATGGTCAT 

AAGATGTCTACAGGTCAGGAAGCAGCTGCACAATGCCTGCATAGACAAAC 

TCTTATGAAAGCAGAATGTCCTGATGTCTCCATAACACATAACAGTGTAT 

GCTTTTATTATGGTCATACTCTAGCTGTGATGTACCTACGCTCTAATATG 

C C AAC G ATAGTTTTCTTTAAATCATCAACATAATAAATGTC ATGCTGTCA 

GTCCCCCACATGTAGACATAACTTAGCTGGTACATGGATAAGAAACCTAT 

ATTAGATAACCTTAGGCCAGGTGTGGTGGCTCATGCCTGTAATCCCAGCA 

CTTTGGGGAGGCCGAAGCGGGTGGATCACGAGGTCAGGAGATCGAGACCA 

CCCTGGCTAACACAGTGAAACCCCGTCTCTACTAAAAATACAAAAAAAAA 

TTAACCGGGCATGGTGGCAGGCACCTGTGGTCCCAGCTACTCAGGAAGCT 

GAGGCGGGAGAATGGCGTGAACCCAGGAGGCGGAGGTTGCAGTAAGCCGA 

GATCACACCACTGCACTCCAGCCTGGGGGACAGAGCGCAAGATTTCGTCT 

C CCAACCC AAAAANCNANNNNAAATTTGCACCCAAATCTGACTAATTCCA 

GAGCCAATTCCAATTTAGAATCGTTATATCTCCCTGGTGAACTGAAGCTT 

TTATCTTTAAGGAGACACACTCTTTATGTCTACCAATGCTTATTGCCTTA 

AAGTCCACTTTGTCAGATACAGCTGCTTTCTTTTAATTAGTTTTTGTGTG 

GTATATCTCTTTCCATCCTTTTTCTTTCAGCCTTCTCCATTCTTACATTT 

TAGATATATTTCTTTTTTCTTTTTTTTTTGAGAGAGAGTCTCACTCTCTC 

GCCCAGGCTGGAGTAGTGCAATGGCGCGATCTTAGCTCACTGCAACCTCC 

ACCTCCTGGGTTCAAGCAATTCTCCTGCCTCAGCCTCCCAAGTAGCTGGG 

ATTACAGGAG C CCAC CAC CAAGC C C AGCTAATTTGTTGT ATTTTTAG AAG 

AGATGAGGTTTCGCCATGTTGGCCAGGCTGGTCTCGAACTCCTGACCTCA 

GGTCATCCACCCACCTCGGCTTTCCCAAAGTGTTGGTATTACAGGCGCGA 

GCCACCATGCCCAGCTGATTTTAGCTGTATCTCAAAAACAGCATGGGTTC 

TGTTTGCTTTCCTTATTCAGCTTTATAATGTAAATCATTTACATCAAACA 

TCTAATACACCATGGACTGTAAAACACAGCCATATTTTATGTATGAATTA 

AAAAAAAAAACACCACCZAATTAGTTCCTGAGACACACACCTTAACAATAT 

CTCTGTGATGTGCATAAATCAATCACATCAGTTTCTCTGCACCTCAAAAT 

TTCTTTCCTCAATTCTCAGAGATATGGCAATTTCTCTGGTTTTACATTCC 

CAGAAGCAAAGAAAAAGTACACAGCTTCTTCAAGTCATGAGTAGCTTCTT 

TTTTATAGCTCTTGGTGTTTGCAAAAAAGATTGGAATTGCTTCACTAATA 

CTAAATTTTCATTCTGCTGCTCTGTTTCTATGACAAGTCAGAGGGCATCT 

TTTT GAAGACATTCTAAACAGCAATTAAACTCAAAAC ATGT AATG ACAAT 

G ACAC ACAAAACTCAACTG ATGACCAAATGAAGAGTTC CAGCCAAGTTGA 

CACAAGCTGGCTGACAGAGCTTGTAATACACACAGCTTGGCATATGCCTC 

GCCATTTCAGAGATGTAAAAATAGGAATAAATGTTTTCCCTTAAATCAAT 

GAAATAGAGCATTTGGACTGAAAATCTACGACAGTTATAGTGTTTTCTAT 

TCATTATTCTCATTCTGTTTCTTCTCCCCCTTGCTTTCTTTTAGTTTGAA 

TATTTTCTATCATTTCATTTTTCTTCCTACTAGTTTGAAACTTATGCATT 

TATTTTCTATTTTTTAGCACTTACCTAAAATTACTCTGTAATCCATGGAT 

CCTTAATTTATTTAAAAAACTAATGTTAATGAGTAGCTTTATTTTCCTCC 

CATCTAATTTAAGGCCCACAGAACACCTTCACTTACCTCAATCCTCTCCC 

AACTTACATGCTTTTAATGTCATATATGTTAATACCGTATACTTTTAAAA 

CTTTCTAAAATAGCATTATTTTATAGCATGAGTGTTCATTTACATTTTTG 

CATATATTTAGAATTTTCTTTGCTCTTCGTTTCTTCTTCTATTTATGACT 

CCCCTCTGGGATCATTTTCCTTCTACTTGAAGTACATAGTTTAGAACTGC 

ACTATTCAATACAGTAGCCACTAGCCATGTGTAGCTATTGAAGTTTAAAC 

TAAGTAAAATTGAGTAATATTAAAAACTCAGTTCCTTCATCTCACTAGCC 

ACATTTCAAGTGCTCAGCAGCCACATGTGACTAATGACTACTGTACAGCA 

AACATATAGAACATTTCCATCATGGCAAAGAGCTCTATTGATAGTGTTCA 

TCCAGAGTTTCTGTTCCAGGACCAAACTGAGGGTTGGGCTGCTATTTCTC 

ATGGCCCAATAACAAGATGCAGATGAGCTGGGGAGGAAGAGAGTTTTTAT 

TT CT G CAAC C AGTTACAGGG AG AAGG C CTGGAAAT CAT CAC CAGG C CAAC 

TCAAAATTATGACGTTTTCCAGAGCTTATATACCTTCTAAGCTATATGTC 
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TACGTGTAAGTGTGCAT1 JACCTGAAGACGTAAGTGATTAACTTCTTT1 a 

ATCTGTAACTAAGGTCTGAGTCCGGAAGATCTTCCCCTGGAGCCTCAGTA 

AATTTACTTAATCTAAATGGGTCCAGGTGCTGGGGTAATTACCCTTATCT 

TGTCCCCTGCTAAATCATGGAGGTTTGGGGAATTCCCTTTAGAGCACCAT 

TAACCTGTTTGTTGAAGGCCTGGGAATTTCCTCCAAACCCCCATTAAACC 

TGTTTAATCCCAAATTGGTTCCGTTAAAAATTCCCTCCTTAATTTGTCCA 

AT?TTAAAGGCCCAAAAAAGGCTGGGGCAAACTCCTGAATGGCCTTTGTT 

ACATTCCAACCTTTGTTTAAAAACACCGGTTTTTAATATTTAACTTAACC 

ATTTAATCTCTACTGAAACACTTGTTATATAAATCTGCATTAATGAGAAC 

TGGCCTGCGCCATATCTCCTTCTCAGAATATCTTAGGGTTGTGATCCCCT 

GTGTGAAGAGAATATATCTCTGGAGATCTCAATCTCTCTACCCCAAAAAA 

AATCTCACTCGGAGAAAACTCAGACTCTTATCTCCACAGCGCTATCTCTC 

TCCTCTCC 

>Concig50 

GCTTGTCTAAGATGGTGCTCCTTGTTGCTGTGCCTGCTTTCATCCTGGGA 
TCTCCCTTCACCATCAGGATTGCCTTCACCTCATTCCAGTCTTGGATCTT 

GTGGCCTCTTGGGAAAAGATGTGTAGGGAGAAAAATTTTCTTTAGAAACT 

TGCATATCTGACAATATATTTATCCTATCCTGACATTTGGTAGATAGTTC 

AGCTGGGTACAGAATTCTAATTAATTTTCCTTCCTGATTTATAAGACATT 

GCTCCATTTTCTTCTGGCTTCCAATATTGCTGCTGAGAAGTCTGACACCA 

TTCAAATGCCTGATTTTTTCCATGTGATTGTTGTTTTCTGTCTGGAGTGT 

TGTAGGATTGCCTCTTTATCTACAGTGTTCTGAAATTTCATGACGTAGGT 

CTTTCTTCATTCATTATGGTAGACACTCAGTGGGCCATTTAATCGGGAAA 

AACATGTGTTCTTCAAGTTCTACAAACTTTATTACTTCCTTTTTCTTGTG 

TCTTTCTCTGGTCTGTTTTCAGCCCCGAGTCTCTTAGATCTGTCCTCTAA 

TATTCCTATTGACTTTACTTCATTTTCTAAGTCTTTATCCTTTTGCTTTA 

CTTTCCGAGAGACCTGCTTAACCTTATCTCCCAACTCTTTTATTGAATTT 

CATTTCTTTTACTATATATTTTTTACTTTGAATACACCTCTCTCTTCCTC 

ACATTTT C C CC CATAGTATTTTGT CTTCAATTGACAGTTCTACTATCTTA 

TTACTCTGGAGATATTAATAATAGTTTTTAAATTTTTATTTATTTTTATT 

TTCAAAACAGTGTCTTACTCTGTCACTCAGGCTGGAGTGCAGTGGTGTGA 

TCATGGATCACTGCAGCCTTGATCTCTGAGCTCAAGCTATCCTCCTGCTT 

CAGCCTCCCAAGTAGCTGGAACCACAGGCATGTGTCACCATACCCAGCTA 

ATTTTTTTGTTTTTGAGGTGGAGTCTCACTCTGTAGCCCGGTCTGGAGTG 

CAGTGGTGCAATCTGGGCTCACAGCAACCTCTGCCTCCTGGGTCCTGGTT 

CAAGCAATTCTCCTGCCTCAGCCTCCTGAGTAGCTGGGATTACAGAAACA 

CACTACCATGCCCAGCTAATTTTTGTATTTTTGTAGAGACAGGTTTCACC 

ATGTTGGGCAGCCTGGGTCTGAACTCCTGACTTGTGATCTGCCCACTTGG 

GCTCCCCAAAGTGTTGGGATTACAGGCGTGAGCCACTGCACCCGGCCACT 

AATTTTTAAATTGTTAATAAAGACGAGGTCTTGCTATGTTGCCCAGTATG 

GTCTTGAACTCGTGGGCTTAAGTAATCTTCTGCCTCAGCCTCCCAAAGTG 

TTGGGATTACAGGTGT GAGCC ACTGAATCTGACATTTTTTAAAAGTTTTC 

TTCTCTTTACCAAGTCTTTTTTCCCCTTTCTGCTTTTTTGGGTTGTTTTA 

TTTTGATCTCTATCTTGCTAGAAACTTTCTGGAGACGTTTAGTAATACTA 

GATTTTTGAGAGTGGGCAACTGGAAAGCTGATTGGAAACTCTGAATACAT 

GGGTGAGGCTTGTTGGCTGTGAGTGTCATTGCTTGATGTCCTGGCAAGGC 

CAATGGGTTTGGGACCCCTACTATTAGTATAGGCCTGATTCCCTGGGAAA 

GGCTCTTTTGATCTCCTGCCTGGAGGATAAAGGCCTGGCTACCAGCCTTC 

TGTGTGTAATGTGAGGGAGAAGGGCTGGAGTATTCAACATCATGCTGAAT 

CCTTTCAATGATCATCTTGT T T TT AGTAATCTCCTACCTTAACTCTCTGT 

CTTCTGCTAGTATGGGAAAGATGACCTGAAAATCTAACCATTTATTTTTC 

CCCCATTAATATCATTTTATGATTATTCAGAAGTTAAATAATTGTCATGC 

TGTCCTCCAAAAAGACTGAATCAACTAGCAACAAATAAGAATTTTCTCAC 

AGCTCTGCCAGCATTTTAAAAGAATAGCTTTATTGAGCCCAGGAGGTCAA 

GGCTGCAGTGAGCTGTGATTACACCACTCTACCCCAGCCTGGGTGACAGA 

GCAAAACCCTGTCTCAAAAAAGAAATTTAAGGAACAGCTTTATTGTTGTA 

AAATAGACATACAATAAACAGAGCACATATTTAAATTGTGCAACTTATAC 

TTTG ATAT AAC CCTGTG AAAACAT CAC CAC AATCAAGATAGTG AAT ATAT 

TTATCACCTCCTGATACAGTTTAGCTCTGTGTCCCCACCTAAGTCTCATG 
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TTGAATTGTAATCCCCAATGCTGGGGGAGGGGCTTTGTGGGAGGTCSATTG 

AATTGTGGGGGTGCACTTCCCCCTTGCTGTTCTTGAGATAGTGAATGAGC 

TCTCATGAGCTCCCCTTCACTCACTCTCTTTCCTGCTGCCATGTGAGGAT 

GTGCTTGCCTCTTCTTTGCCCTTCTGCCATGATGTGTTTCCTGAGTCCTC 

CCTAACCATGCCTCCTGTACAGCTTGCAGAACTGTGAG7CAGTTAAATCT 

CTTTTCTTCATAAATTACCCAGTCTCAGGTGGCTCTTTATAGCAGTGTGA 

AAAGGAACTAATATACCTCCTAAGTTACCTCAAGCTTCTTCTTAATTCCT 

TCTCCTCCCTTCCTTCATTGCCAAGCAAACAACCACCTGTTTTCTGTCAC 

TATAGATTAGTTTACATTTTGTGGGTTTTTTTTTTTTTTGAGACAAGGTC 

TCACTCTGTTGCCCAGGATGGAGTGCAGTGGTGCGATCATAGCTCATTGC 

AGCCTTGAACTCCTAGTTTCAAGTGGTCCTCCCACTTCAGCCTCCTGAGT 

ACCTGGGACTACAGGGGTACACCACCACAACTGGCTTAAAAAATTTTTTA 

AATAAAAATGGGGTCTTGTTATGTTTCTCAGGCTGGTCTCGAACTCCTCG 

CCTCAAGCAGGCCTCCCTCCTTGGCCTCCCAAATTGTTGGGATTACAGGC 

ATGAGTCATGACTCCTGGCCTAGTTTACATTTTCTAGAGTTTTGTATAAA 

TGGAAACATACAGAATGTATTTTTTTGCGGAGTGGGGGAGTGTTTCTATT 

TCTTTCTTTCTTTTTTCTTTTTTTTTTTTTTTTTTTGAGACGGAGTCTCG 

CTCTGTCTGTTGCCCAGGCTGGAGTGCAGTGGTGCGATCTCGGCTCACCG 

CAAGCTCCACCTCCCGGGTTCAAGCAATTCTCCTGCCTCAGCCTCCTGAG 

TAGCTGGGACTACAGGCGCCCGCCACCACACCTGGCTAATTTTTTTTGTA 

TTTTTGGTAGAGACGGGGTTTCACCATGTTAGCCAGGATGGTCTCCjATCT 

CCTGACCTCGTGATCTGCCCGCTTCGGCCTCCCTAAGTGCTGGGATTACA 

GGCGTGAGCCACCGTGCCCGGCCCAAGTGTTTCTATTTCTTAACCAGCTT 

TCATGCAATCTTTTTTTATTTTACCATCTCTGTGATCCCACTCCCAAAGG 

TACTAGATGTCGATTGGTCCTTAGGATCAGCTACCATTTGCCCAACTGCT 

TTCCAGCCTTCCAAAAATTTTTTTCTTTTTTTCTTAAAGATACTCCTGTG 

TGAGGCTCAGAACTCTTGAATTGCTACTGCAAATATGAACTCGGTGATGT 

GAATGCCAGGGAATTGCCTGATTGATCAAAGAAATGTATCCCCTTCTCCC 

TCACTCTTGCTGTCTTCTCATTTGTTTTCCCCATCCTTGTGGATTCGTGA 

ATTTAAATATCCCTTTAATGTTATAATATTTTAATGGCGTTTGGCGAAAA 

GTACAGAATTAGGTGCAAGAGTGCATAGCTGTTATTTTTTTTTTGGCCTC 

TGAGACTGTTCATATATGCAAGTTATTTAACAGAAAGTTCTGCAGTGACC 

TGAGATGTCAGGGGGGTCTGATAGAGTACGTTTGAAGGCAGTTACTGGAA 

AAAAATAATGCCATTTCTGGTTTGTACTTCGGTAAGTTCAGATGACCCAA 

TATATTGTTTACATGTGGCATTCAGTAAAAAAGTAGCTTCCCCTCCCTTT 

CTTCTTCCTTTTCTCCTTTCCTGCTTCTATAAAGCATCTGCTTTGGGAAA 

CTTCTTAGGAGGAGAGCTTGCCAGCCCGTGGGTAATGGAGAGGTCTTGCA 

GAGATAAAAGAGATGCTCCCACTCAATGCAGGATGGTGTGGAGGTAAATG 

GGGATACGTCTGGCATCACTCAGGAATGGGCCTTCCTGGCAGGGAAGAGA 

AGGGAGGGGAAAGAGGAAGGGAGTCAAAGATGAATTGCTGAATACGGGGA 

TTCCAGGGCCTGGAGCCAGGAAGAGAACTTTGGGAGGTGTGAACCTGGAG 

GGCATCAGCTGATGAGGAGCAGCCTGAAGTCCGGGGAGGACCTGTTTTTG 

GTGGCCAGGAAGAAAGTGCCTTCCACACACAGGGAGGCCACAAGGCTGAT 

GGGCTGGGGGTTGGAAGGACAGCCCTAGGACAGGCTTGGGAAGCAGGCTC 

AGGTAGGGACTGCGAGGTTCTTGTTGAGTCTTTTTCATTCCTGGTCTTAG 

AAAATAGAATCGAAGGCCTCTTGAGAGTGGAAGGTGGGTTGGGAGGAGGG 

CAGATGGGGCTTAGGCCCAGGACACCCGTAGAGCTACTGCCCAGCTGTCT 

CTCAGGGACTCTGCTGAGGTCACTCCAAGGATCATTCTTAGCCTTGCTAG 

ACAGTACTGACAGAGGGAACCGTAGTATCGCACCCACTTCCTTCTCTTTC 

AATGAAAGTTTAAAGGTCACCATTTCCTCTGGCAAAGGAAGTTCCACAAA 

TATTCCATTTCCGGTCTTAGAAACAGCAAGGTATCAAGCAATTGCAAACT 

TCCTGTGCTGGGGAATTCCCAAGGAAGTAGGGGCAGAGTTCTGGTGGAGA 

CAAAGTGAATTCCGAGTGATTAGTCAGTAGCAGTAGCAGTAGCAGTAGCA 

GTAGCAGTAGCAGTAGCAGTAGCAGTAGCAGTAGCAGTAGCAGTAGCAGC 

AGCAGAACCAGAATTTCCCCGCACGTGTCTCAGGCTCTCATTTGCCAACT 

CAGTCTCTAAGTATTTTTATTGGCAGGAAAAATAAAATAGCTATGAGTGA 

AATAATTCATTAGACCTGAGCCTCCATCAATTTTGTGTTTAAAGGCCTGA 

CTCTCTTTACCTTTCCCTGGGATGGAAGATGCAAATGTTCCTGATGTCAC 

TGTCAAAAAAGAAGAACCAGTGGGTATATTGTATGCTTGAGTTCCAGCCA 

TTTGTCACAATAGATAGAGATGACTGCCATGTGTGTAGACTTTCTATAGA 
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CTGTGTGCTAAACCCGACCTGCCACTTCCAAGGAGTAGATGAGGAATG * C 
CATGGTTCTGGGGAGCCCTACCCCAATTTGGGGCAGACATTCCAAAGCTC 
ATTTTCTGTGGAGGGGGTTGATGGTTAAAGGACGGCCTGGGAGTAACTCG 
TCTGTACTAGGGCCCAGGAGAGTTACATGCTGCTTCCCATGTTATTCATC 
ATTCCCCCATGTGAATAGCTATGGCGTGAGGTCCAAGGTTAGGGCCTTTC 
TACCATAAATGGGGGAATAAAATTCCCCTACCAGCCTGAGAAGTTTCTGT 

GTTATTTCCATACGGTTTGTTTTAAAATGTAGATGTCATATGCAGGAGAG 
GTGGTGTAGTGAGTCACAACGGGATTAGAAGGACCAGTCCGAAAAGCAGA 
AGAGGGTCAAGTTCAGGGCACTGAGGACTACTGCATTCAGTGGCGTGAAA 
GGCAGATGGCTGAACAGGAGGGGGACATTACATTGCTTGTTCTCCTTGAG 
CCTCGATTTCCTCATCTAAAAAGAGGGTCATTTATTCACAGAACATTTAT 
TAAACTTGTGCCAGGCACCGTGCCAGGAGCTGGACTAAAAATTAAATCCA 
CCCCTGTGAGCTGCTCTGAAGGCTAAAATATGAAGTATGTAAAAGTAACC 
AAGTGCTGTACACATGCAGCTATTCAATGACTGTGTGGGCATTGCGGCAG 
ATTTTAATTTTCTTTTTTATTTCTTTCTCTTTAGTGAGAGGTGTTGGTTG 

TCCAGCCCATTCCAGGGCTGTCATCTAAGACACTTCTTATCACCTAAATA 

ACCGGGGAGGCAAAGCGCTTTCTTAAGAGATGGATCCAGAAGAACAATGC 

TGGTTTTCTGTAGAAAAAGGGGCTGTGGGAAGTAGAGATAAGAAGGGAAT 

TGGCCAAGATGAATGTACAGAGCCTTATTTTTTTTTTATAACACAGCAAG 

ATTAGATACAAAACAGGACAATAGCATCATCTGTTTTTATAACTGGAAAG 

GACCTCACTTTACAGGTGGGGAAGAATAGAGTGGAGAAGTGAAGAGAATG 

GTCACAGAGTCAATCAGCATGTCTGCGTCAAAGCTGGGATTCCCAATTCA 

GGGCTCTTACTACAGTGACGTATGGCTAATATTTTGGCATTGTTTCGGGG 

AAAAGCTGAAGCCCTGATGGTGTACGTCACTCTTGAGATAGTCTGTAGTC 

CAGCAGGGAGGAAAGCAAGGAAGGGAGGTGGAGGCAGCATTTTTGGGTGT 

AACATTTCGTTCTTGTTTTGTGGCCAAATCATAGTGTGATTGGGACAAGC 

CACTGCCTTTCTCTGAGCCTCCACTTTCTTTTTCTTCTTAAGAGGGAGGG 

AATAGTAGAGTAAAAGTAGTCATTTTATCAAACACCTGCTATTTTGGAGC 

CATATTGCAAGTGGGTTGGGGGTTGAACACTTGGCTTTATTACCCATAGG 

ATTAAATCCAACCTCGATACTGTGGCATTCCCAAACTCCAGTCTAATCTT 

CTTCTCCATCAGCCATGCCCCACGACACCCTGGTCATATCTGATGTTGCC 

CCTTGCACTTGCCCCCTCCTTATCTTTGCTTTCTGACCTACCATATGGCT 

ATTGGTTGAAATTCTCATTTTCCAGGGCCTTGCTTAAATATCATCTCATC 

CATTAAAACTTTCTTGAACCTCCCCTTGCCCTGTTCCTCCCTAATGTCTC 

AAG C C AGAATTT ATTTC CTTTTGTGGC CAAGGGACTGGGTTTGTGAC CT C 

TCTCACGAGACTTAATATTGAGACCAAACGTCTTTAGACCTCACCAGCCA 

3AGAGATGAGCATCTATGGAATGCAGGCTTTTGCCTGGACTTGCTGATGC 

AGGGCCTCTGCCTTCCTCCAGGGCCTCTCCTGCTGTTTTAGGAATTTCCC 

TCATGGCACAGTCCATGAGCTCAGGGTCAAGTTCATACATGTTTTTACTT 

CTTCTACTCTGCAAATGGTCTTCTTGAACTCTGAGGGTCCTAAAGCTGCT 

CTGCAGTTTGTGGGGTGAGTAGAAAGGGGCTTTCAAAAGTTGTGCTGTTG 

TTT C C C AC C C CAAT AGCATGAAACACAAAGATG CTTACAAATAGCTGC CT 

TGCTTTCTAGTCCCAACTTCTCTCTCCTGAGGCTTTAAAACAAGTCCCCT 

AGGTTGAGCTGGACTGGAGTTGTATCCTATCTTCATTATCTGTCTACTCT 

CTTTCTGCTCTCTAGAGAAGATATTATATATGTGTGTATGTATGTGTAAA 

TATATAATATCCATATATAGAACATATATTGTTATATTTACATATACATA 

CATAACATATGCATGTATTCATATATACATATGTAGTATCAAAGTTGGAA 

TTAAACTGTATATTTTGTAATTTGCTTTTATTTGCATCTATCACTGTAAA 

TTTGAAACAGGGTCTTGCTTTGTTGCCCAGGCTGGAGTGCAATGACCCGA 
TCTTGGGTCACTGCAGCCTTGACCTCCCCGGCTCAAGTGATCTTCCCACC 
TTAGCCCTCTGAGTAGCTGGGACTAAAGGTGTGTGCCTCCACACCCAGCT 
TTTTAATTTTTTTTGTATTTTTTTTTTAAAGACAGGGTTTTGCCACATTG 
CCCAAGCTGGTCTTGAGCTCCTGGGTCCAAGCAATCCTCCCACTTTGGCC 
TCCCAAAGTGCTAAGATTACAAGCATGAGCCACCACACCTGGCCTCAATG 
TAATTTTTAATGGCTGTATAGTATTCCATCATGTGGTTGTACCCAAAATT 
ATTTAACCAGTCCCCAGTTTATTTCAATTTTTTTTTACTATTTTGAATAA 
TGTTTTAGTAAATACCCACAAAATATGTACAATGGCTGGGCTTAGTGGCT 
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CACCCCTGTAATCCCAAv^CTTTGGGAGTCTGAGGClA.GGTGGGT^cdt'G 
AGGTCAGGAGTTCGAGACCATCTTGGTTAACATGGTGAAACCCCGTCTCT 
AC C AAAAAT ACAAAAATT AG C C GGGTGTGGTGGCACACAC CTGTAAT C G C 
AGCTACTTGGGAGGCTGAAGTAGGAAAATCACTTGAACCTAGGAGGCGGA 
GGTTGCAGTGAGCCGAGATCACACTACTGTACTCCAGCATGGGCAACAGT 
GAGACTCCATCTCAAAAAAAAAAAAAAAAAAAAAAAAGTACAATTTGTTG 
TACCTCCCTGATTATTTCTTTTAAGTAGAATTTTCTTATAATTTtTTTTA 
TAAGTAAAATTTTGAATCAAGGGAGAAGCACCTGGAGTCCTTCAGATACC 
TATTGCCAAACTGAACTTTTCTGTTCCAGGTTTACTACATTCAGCCTGAC 
TCAGGGTTTGGGGAGTAGAGGAGGGGGTGGAGGCAGAGGGCCTCTCCCTG 
TCCCCACAGACCTCCCTTGGTGAGGTCCAAGTCTGGACAGGTGGAGTGTG 
GCATTGCACCGTCAGGTCCTGCTTCCTGTAATTCCCCTAAATCCATCCAG 

CTGAAGACGGAGTCTCACTCTGTCGCCCAGGCTGGAGTGCAGTGGCACGA 

TCTTGACTCATTTCAACCTCTGCCTCCCAGGTTCAAGTAATTCTCCTGCC 

TCAGCCTCCTGAGTAGCTGGCACTACAGGCGTGTACCATCACGCCCGGCT 

AATTTTTTTTTGTATTTTTAGTAGAGACGGGGTTTCACCATGTTGGCCAG 

GCTGGTCTCGAACTCCTAACCTTGTGATCTACCCGCCTCTGCCTCCCAAA 

GTGCTGGGCTTACAGGTGTGAGCCACCAGGCCTGGCCTCAAGTCTATTTT 

TTAACTCCAGGAGGCCTGGTATTCAGAGGGATTAGGGCTGGCAGAAGGGC 

CTCAAAGCTTTCAAGGCCTGGGGAATAGGCTGCAGCCTGGTTCAGGGTAA 

CCCAAGTGATTTTGGTTCCAAAGGGACAGGAAAAAAAGTGATTGATATGG 

AAGTTGTCAAAGTGCAACTGTCAAGACATTAAAAAATGTAACCCTTTTAC 

TAATATACAGTAGACTTGTGTTAAATATTTAACTGATTGTAAAAGGAAAA 

AACCAGACGCAGTTTTCCCTACCATACTGTCACAACACCTCAACACTGAG 

TTCTTCTGTGACCTCTAGTCACCGAAATGCTTGGGGATTTCTCCCACCAC 

TAGTCCTCCAGCAGCCGACACCAGTTGGGTGTCCTAATTCACTCCAACAC 

TATCTACCTGGAGTTAGCGTTAGATCCCACAGGTTGAGGGCTCAGTCTCA 

CAAGACTGCCTCCCACTTCAGGTGCCAGTTACAAGTGGTAGGTTGTCACC 

TATGCTTCTGACTGATGGCTATAAATCTGGGTTTGCTTCCCTCGGGTTCC 

GTGAATTTGCTAGAGCAGCTCACAGAACTCAGGAAAACACTTAAGTTTAC 

CAGTTTATTCTAAAAGATATTACAAAGGATACAGATGAACACCAGATGAA 

GAGATGCGCAGAGCAAAGCATGTGAGAAGGGGTGTGGAGCTTCCATGCCC 

CTCTGGGGCACCACCCTCCAGGAACCTTCATGTGTCCAGCTATCTGGGAG 

CCCTTCCAAACCCTGTCCTTTTTGGGTTTTTAAGAGTGGCTTTATTACAT 

ACACATGATTGACCGAACCATTGGCCATTGGTGACTGACACAACCTTCAG 

CCCCTCCACTCCCTCCAGTGGTTGGGGAGTGGGGCTAACAGTCTCAAGTC 

TCCAATCCTGCCTTGGTCTTTCCTGTGACAAACCCCATCATGAAGCTACT 

GCATTGGGGCTGCCAGCCAGCAGTCATCTATTAGCATGCAAAAGACACTC 

TTATTATTCCAGAGATTCCAAGGGTTTTTAAAAGCTGTATGTCAGGAAAC 

AGGAGATGAAGAACAAATATATATTTCACAACATCACACTCGTTGGGGGA 

ATTGACAGGATAGCAAAACTGATTAAAGGAGGATAGGAGAGACTGAGATA 

TATATTTCCATATATATATATAGAGAGAGAGAGAGATATTTCCATATATA 

TATATAGATCTAGAGAGAGAGAGAGATAGAGAGAGAAGAGTCTTTCC 

>Contig51 

ACACATTTGGGGGAGCAGTTCCGGAGGTACAGCCCGGACAGGAGATGTGA 
GAAGATCGTGGTTANTGTTCCCCTGGTCCAGAACCCCTCCAAGTGGGCTT 
AAGTAGGAAGGGTGGTGAGCGGCAGGTAAACACACGTCAAAGGCAGTCTT 
CCTCTCTGAGGGAAAACACTTGTATAAGCATTGCAATCAATGGGCCTCTT 
TAATTATGTGCCAGTGGCAAGAGCGGGTGCTGAACCCAGGGGCCTGCCTC 
AATCCGGGGCCTTTGAGGCAGAATAAAGTGGTCTCAGGTTGTTGGCATTT 
CCTTGCCCTTCCACCCGAAGCAGACACAAATCCTCTCTGGAGGCAAGTTC 
C CCAATT CAGC CAGTACAACTC CCACAGACT AAGATCAATCATGTACAAG 
CTCACAGACAAAGGTCACCAAACACACAGAGCAATAAACAAATTCATGAG 
TGACGTGAATGAGAATAAACAGAAACAATAACCACCAGCTGGGATGCTCT 
AAGTCTTCAGCTGTTAGAATTCCTGAATATAGAATAAAACTGCCACAATG 
GCAAACATGCATCTAGTACTTACTGTGTGCTGGGTTCTAAGAATTTTGCA 
CATTGTGCCAGATACCGACTCAGCTTCACACTCACCCTCCTACTGTGCCC 
TCTTAATTTGCACTAGATTAAAAGGTAGAAAGGAAGAGGCAGCTATTCTG 
TTCTTGGCTGTGCCTCTGGCAGCACATGCAAAATGGGCAGTAACAGTGGC 
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AGTCACAGGTAAGTAGC TTCTCACAGx o rGGAGTTAAAGGCATGGGAv. J ' 

GAGACGAGCAAGGTTCCTAAAGGGACAGTGGCCAGTAAATGACCAGGGGC 

TACTGGAGTGGCTGCATGGCTCTGTGGAAGCTCAGAGGAGCCTTGGGTCC 

TGCAGGTGCAGTAGCAGCTTTCTGTAGTTCCTGATCTCTGGGTCCCACAA 

TC7TCCCCGTTTTTGCTCCTCCACTTCTAATTTTGTAACTGACTTCCCTG 

TGTGTACTTCTCTCTCTGATTGAAATAQCCAGACTGGTTTCTGTTTCCTG 

ATAAGACATTGTCTGGTACGAACACAGTAACTCATTTAATCCGATATCTC 

TATGAAGGAGGTACAATAATTATTCCTATTTTACAGATGAGGAAACACAG 

CAGAAAAATAAAGTCAATTGTCTAAGGTTGCACATTTAGTCAAGGGAAGG 

GTTGATATAACATATAATTATTTAGAAAACATCTAAGGAAATAAAAGGCA 

TAATTTAAAAATAAAACTAGGCAGGTTTAAAAAAATGAAGTAATCTATAA 

GTAAAAAAGTATAATTGTTGAAATACATATCTTAGTGGATGGGTTAAATA 

GCTGAAGAAATGATTAATGAACTGGAAGGTAGTTCTGAGGAAATCAGAAT 

TCAGCATAGATAGAAAAAATGGGAATTTACAAAAGTACACAGGAATTATA 

AAAGAGGTTAAATTATAGGGAGGGTAGAATGAGAATTAACATTGGTCTAA 

CTGGAATTTTGGAAGAAGAGAATAGAGAGAATGAACAAGGCAATATTTAA 

AGAGGTGGCTGAGAATTTTTCAGAACCAACACAAACTATGACTTTACCAG 

TAGAGAAAACAATGTACACTGAGGAGGATAAATAAATATACTATGAACAA 

ATTGTAATAATAATACTCAACAAAGACAAAGAGAAGATCTTAAAATCAGC 

AAAAAAAGAAAGTCAGACTTAGAAAGAAATGACAATGGCAGACTACTCAA 

CAACAACAATGGAAACCAAATTCAGTGAAACAGTATTTTCAAAAT.GCATA 

TTTAATCTATCTTTGAAGAATAAGGGTGAAAAGGGTGAAAATTGCTGCCT 

TATACAAAATATCAACATTAACAAAAAGTAATGAAGGTAATATAAAAATG 

TTTTCAAATAAACAAAACTGAGAGAGTTTACCACCAACAAGCATTCATTA 

AATGGACTTTTAAATGCAGTTTTTAGGAAGAAGGAAAACAATTCCTAAGG 

AAGGTCTGAGATGCAAAAAGGAATTATGAACAAAGAAATTGTTAAAATTA 

TAGGTGAATTAAAAAAACTGCCTGCATAAATGATAATAATGACAATGATG 

CTATTAATAATGAGTTGATAAGGATAAAGAAAAGGACAGAATTAAAATAC 

TAGAAAACAAGCATGCTGGAAAGGATTCAGGAATTACTTGAAGGTTAAAG 

TTCTAGGGTCCTTCTATCCTTCTAGAGGGGAGTCAATATATTAATTTTTG 

ACCGTCACTTACACAGTGAAAAACTTTAAGGATAACCATAAAAAAATAGA 

AATAGAGAGTATAACTTCTGAAACAGTCAAGGGAAAAATATGGAATAAGA 

AAACTGACCAAAAAACATCTCAGTCAATCAAAAAAAAAAAAAAAAAGAAA 

GAAAAGGTTCGGAAGGAGAAAATCAAAGCATAGAAAAAGCGGGACAAATA 

GAAGTGGAAAAGAAAAAGGTAGAAGAAACAGGTCCAGAAATATCACTGAT 

GC ACT AAATCAC CATTAAAAGATGAAAAC AAATG AAC AACAT CAAAAAAT 

TCTAGTGACTGTAGTAGTGCTGATCAGAATAGGCTCTAAGATAAGATGCA 

TTATTGTGAGTCAACTTGTGATGATGAAAGGTTTAATTCACCAGAAAGAC 

ACAATTATAAACTTGTAATCAAATAGTTTTATTTTATTTACTTTATTTAT 

7TATTTTTTTTGAGACAGGATCTTGTTCTGTTGCTCAGGCTGGAGTGCAG 

TGGCTTGATCTCAGCTCACTGCAGCCTCCACCTCTTGAGGCTCAAGCTTT 

CTTCCTGCCTTAGCCTCATGAGTAGCTGGGTCCACAGGCACACACCACCA 

AGCCCTGCTAATTTTTGTATTTTTTGTAGAGATGGGGTTTCACCATGTTA 

CCAGGCTGGTCTCAAACTCCTGGGCTCAAGCGATCTGCCCCCCTCGGCTT 

CCCAAAGTGTTGGGATTATAGGCGTGAGCCACGGTGCCTGGCCTCAAATA 

ACTATTTAAGTGAAACAAAACTAGTATGGCACTAATGAAAAATGTATAAA 

TCCATAATCGCAGAGGGATTTCAACTTACTTCTTTCGATTATGTAAAGGT 

CAAACAG ACAAAAGACAATGACAAAACTTAATG CAATGAACACTTTTGAT 

TTAATGAACATATATTGGATATGTACCCAAGAATTAGAGAATACATACTA 

GTTTTGAGTTTATGCAGAACATTTACAAAAATTTAGTGGAAGCCTAAATT 

ATAAAAAGTTGCTGTCACGTAGAATAACACACAAACCCCTGAGTCCGGAA 

TTCAAAGCCCTCCACACTCTCCTCTACCTTTGCATCTTTATCCTCCACCA 

CACTGCAGTGCATACTCTGGGCTACTACTCACTGTTCTTGATTCAAATTC 

CATGTTCTGTCAGCTCAAATCATTCTCTCTGCCTGGAATAACTACTTCAT 

ACATATTCTGCTATTGAATTCTTGTCTTAGCACCCCATCTACTCCAAGAC 

GATGTCCAGTTGGGGTTACTCCCTGTCCCATTTTCTTTGATTACACTTTT 

TTTTTCTACTTCCATTATATTATTGATCACATCTGTGCCACAGTTTTTGA 

CTTTG7GTCTGCTTTTACTCTTTTCTAGACCCTGAGAGCTCCTGAAGGGT 

TGGGTCATTTCTTTTTTATTTGCTCATTCCTCATGGCACAGTGAGTGCTT 

AATAAATGGCTATTGACTGAAATTAAACTGTAT CTAAATGG ACAT ATT CC 
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ACTTCTGGGCCATTCATrCTTTCTTTCTATTGGAACCAGGAGATGteGGAA 

CCATAACAAAGGTAAGGTTGTGCCATGTGAAAGAACATGGAACCTTCCCC 

TGAGGG CC AAAAAAGAGCAGGGAAAGGTGCAAAGACAAAATCTTCCATTT 

TTAAACAATGTAAGAATGTGGTCCACCTCATGCTCAGGTGGGAC7TTATC 

ATGACGTTATTTTTGGGGACTTATAGCTGCATCATTTACCCCATATACAT 

TTACCTTTAGTGTAGGGAACTGAGGACAGGAATTTTGTTGATGCAGACTC 

TTGCTAATGAGGCTAACACTTGGAGAATTTTTATCATGCATTCAAGAAGC 

TTGTTTTACATTTCTTCATTAATACTTTAGTTGGTGGTTTAGCTTTAGTT 

GTAGGCTTATCAGATATTTGGAGATATCTTCATAAACGATGGCTTTGGTT 

TTAGAAGAGTTATTCTGAAGCTACTATTTCTGGCAATAATCAAACAGCAT 

GGCCATTTGTTTTGTAAGGCCTTTCCTAAAATATGACGGTAAAATCTACG 

TGTGGAAAAATGCTTATTCTTCTGTCCTCTATAAATGTGAATCTAGTTTG 

TCTTCAAAATGAAATCAAGTGATTAAAATGTAGTTTTCTAAGAAGATAAA 

TGGAGCAAAGCACTCTGTGTTTCACAGTGTTGGAAATCACTCATCCCTCA 

TAAAACTGTCCCAACTGATCCTGACTCACATGAATGAATTAAAATAAGAG 

TTAATAACATCAATTTACATTTTTAAAGACACTTTCCCATGT-TTTAGACT 

ATTGGTTGGAAAAGCTGGTAGGTGTACAATTTGTGGAGAGTTGGCTGTTT 

TTGTCTGTCGTTGTTTGACGTATTTCAAAGCCATATCTAATTTTGTTGCA 

GAATGGTCTGAATTCTACAAAAATGTTGAGTTGTGTAGTGTGGAGAAGTA 

CGGAGCCATTTACTGAAAGGGTGGGGGGAAATGACGAGACCCTGAGATAA 

GGCAGTAGTGGTGCGAACAGAGTGGAAGGGAGGTAGTTGAGATATGTTCA 

GAGTAGAATCAGAATGGACATAGTGAACAACTGGATGCAGGTGGGGGCTG 

AGGAAGCAAAGTTGAGGATAATTCTGAGACTTCTAGGTTGATCCACTGAA 

GTTACATTATTCAACACCACAAGGAAACTAGGGGAATGAGAAGGCATACT 

GGTTTGCTTTGGAGTGGAAGGGCAGTGATGTAAGAGGAGTTAATGAGTTA 

AAGTTTGGATATGCCTGAACTTCAATTTGATATGTGCATCTGATATACCC 

TTGGGGTGACCCTCCAGGCAATGGTTGAACATGTGTATTTCTTAGTAACT 

GATAGGCATCACAGACTCACATCAGTAAGGAAGCAACAGCAAACTTGATT 

GGACGATATACCTGGAACTCAGTACCCTATGACTGGAGCAAGTCTCTGTC 

AGTGAAATGAGGATAAGAAGAATCTTGACCTTGTGGAATATGTTGTTAGG 

AATATATGTGATGAACAACATAGGATACTTCCTACAGGGCTCCACATGTA 

GTAAGGGCTTTATAAATGCTTGATAAATATTATTGTTGTAATTTATTTCC 

AAAGTAAGATGCCACTGGAGGAATCTTTGGAACCCAAATTAATAACAAAT 

AGGACTGGATGCAATGGCTCACACCTGTAATCCCAGCACTTTGGAAGGCC 

AAGGCAGGAGGATCTCTTGAGCCCAGAAATTCAAGACCAGCCTGGGTGAC 

ACAGGGAGACCTTGTATCTATGAAGAATTAAAAAAAATTAACCAGATGTG 

GTGGTGCACGCCTATAGTCCCTGCTGCTTGAGAGGCTGAGGTGGGAGGAT 

TGCTTGAGCCCATGAGGTTGAGGCTGCAGTGAGCCATAATTGTGCCACCA 

CACTCCAGACTGGGTGACAGAGTGAGACCCTATCTCAAATAAATAAATAA 

ATAAATAAATAAATAAGTACAAACCAGCAAACACTAATCCTTTCTAGAGA 

TTATTGAACTCTGGAGGGCAGATCTGAATGGAGCCAGCAGAGGGACCTAT 

GGAGATCAGCCTGGCCCTGGACAGCACCAGGCAATGGGGTTGCTAGAGAG 

GTAATGGGGTTGAACAGGGTTTAAGCCATGAGGTCTCAAGAATCCGTGAA 

GACTCAGACTAATTTTTTTTTTTTTGCATGAGGATTAGGTGTTCCTAGGA 

ATTTCAATGAGAGCAGGGTTAATGAAGGAATGCAGGGTAGGAGAGCTGAG 

GGAAGGCATCTGAGAGAGCCTGGCTTATGAATGGCTGCGTCAGTATGGCT 

CACCTGCTTTCCTTGTATCTACTTAGCAGATGATCCCACCCCAGGCCTCC 

AGGGCCAAGGTCATTTCCACATAGTCATGGGCCCTTGAGGGCCTGGAGCA 

GTGTAAGGAAGACAGAGTCTTAAGAAATTGCATTAACAGTCATGGTGCTT 

GGCAAGTGTCGTCATCCTATGCCAAGCCTGATCTGAAGGGGTGCATGCTC 

ATAGGTAGCTGCTGCCCAAGATTACAGCAGCTTCTTCAATCCCAGATCCA 

TGCTCTCCTATATTCATTTTTCCAGGGGTTCCTGTCCTTCGACAGTGATG 

AGATGCAGAATGACTTATTGAGTTATTCTCCTGATAGTTGCCAACTTTTC 

CAAATGACAATGGGGCATGGAGCTTGAGAGTGGAAATGAGGCCCTAGGGA 

TAGCGTGCTTAGGAAAACACTCCCAGCCTGATGTAATTCTGGGGGTACAA 

TGGCATTTTCATCATCAAGACTGATGTAAAGGGTGACTAGCAGTGAGTTG 

GGGGTGACTCGCACTGGGGCTAGGTTTCTGATTCTGCCTAATCCAGACAG 

AGCAGAAGCACTAGTGGGCTGGTAGAGGGCCTCCAGGGCCTCACTTAATG 

TCCTGGAAAAACAGCTCCAGATTGTTGGTTCACGTTCTGAGGACAAGCTT 

GGGTACTACAGGATAGAGAGAGTGGTGGGAGATGCCGTGGCCTGCCCTGC 
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TGATGCCTGCCCTGCCATTCCTGCGTGTGATGTCTCTGGGGCATCTTGCC 

TTCCCTGCCCAGACCTGTAGTTCAGCTGAGGGCATGTGGAGGCCAAATGG 

CTTCTTAGAGTGTTACTTTCCTTGAACAGCTCTGCTGGGAGAACTGGAGG 

AGCTAGCTAGTCACGGTAACTGCAGCAGTCAAAGGATCGTCCCGGTGGAG 

GTGGGGTGGAAAGGTAGAGAAAGAGAACATATAGCGTTTTCCTTGGAGAT 

GTGTGGGCATGTCATAGAGGAAATACCCAATTCCTGAGCCTTGAGCCCTC 

CAGGAAACCTTGGAATATTAGGTTAGTCATCCCCAAGGAAGTCTAAGAAT 

TCTGGTCTCACCCATCTCCTTTAATTCCCACAATGATCCTACATGATATT 

AAGGAACACGGGCCAGTAACCCTCCAAGCAATGGATGTGGTGGTGAAGTT 

TGACCTCATGATGGAGCGGAGGTTGGTTTGAAACCTAAGAATTTAATTTA 

TTGTTTCAAACTGTTCTCCACTCAGCGTTATTAAAGCATACATAATTGAC 

ACATAAAAATTGTATATGTCTACGGTGTACAATGTGATGTTTCGATCTAT 

GTATACATTGTGAAATGATTACAACAAGCTAAATAACATACCCATTCATC 

GTGTTTCAAAGGAATTAAACTCAAGCACAAAAGAGAGGTGCTGTTGAAGA 

GTAGGGCTGCTCTATCTAAGTAGTATGTCTGGGGTTGTCCTGGATCAGGG 

TCCTTTTGTGCTAGTAATAAACCAGCCCTTCTGGGGCTGCTC€ACTTTCC 

CCACATTTTCTTCTGGAGCCTCCCTAAGAATTAGGACATGGCCACTTTCT 

CTGCATAGGCTTCCTACTTCAACAAGGACAGGGCTTGTGCTGCCCCATGC 

CACTTGAGTGTCCCTACAGCACAGAGCTGAGTGCACACTGGCTGAGTGAG 

GAAATCCCCCAGATTAATCTTGGTTCTAAGCATCATGGCTGTATTTCACA 

CGTATATGAATTACAAATTACAGCATAGTCGAATAAGGATTTTTGTGCTA 

CAACTGGAATCCCAGATTATGCAAATTGGATAGTATAATATTGAAATTCC 

TAGGACTTTTTATTAGTTTTAAAAAATTATACAAGCTTAGAGTAAGAAAT 

TAAACAGTGCAAAAGAATTCACTGTGAAAAGTAAAATGCTCTGTCTCTGC 

TGAGAGACAGATATTGCAGCCCAGATACTACTGGGGTCAATAGTTTCCTT 

TAAGCATGCCATTTTGATGGTTTATGGGACTTACAGCTCAAGAAGCTTGA 

CACTAGGGTTGATCTCAGAAAATCATTGTTGCAGGTATTAGATATGACCG 

TCTCATAAAGATACACACACAGACACAGCGATTGGAGATATTCACTGGGG 

CTTATGGGCTGCTTGTCCTTTCTGCTCTGTGCCTAAGTTGGGCTCAGAGT 

AGCCTGGCATCGGCTGTGGGGAGAATGCTGGCATGGGCTTAGCAGGAGCC 

CACTTAACATGTCCTAAGCCACCTGGAAGAGTCCTTCAAGGAGACCAGAC 

TCCAGAGGCCCTAAGGAAGGAAGGACTTTTGCCCGTTTTTAGGTATTCTA 

GTCCCAGAGTTTAGGGAGGAATGGTTTGGCTTTGGGTCGTGTGCCCCTTT 

ACCGAGTGGGATGGGATGTGCCCATGAGCTGTTGAGCTGGCTCTTGGAGA 

AGACAGCAAAAGCGGGAATAAGAGGTCAGGAAGCTGTGTGGTTGTAGGAA 

ATCCCAGCAGAGGGCCTGGGGGTCAAAAGTGGTCATGGTAGTGACGGTGG 

AGGCTGAGGTGGTAGAAAATCAGAGGACAAACCCCATGGGCTGCTGGTGA 

TCTGACCGAGCTCCTATGCTCTCCTGGTTCATTTTAGGCTCTGTAGCAGC 

AGATGATTGGCTGGTGTGAGAGCAGTGCACCTGCCATATCAGGCAATCCA 

AGACAAGTCCAAGCTACGCTGGGAGGAAACCTGAAGGCAGCAGCAGGTAG 

ACTGGCTGAAGACAGACAGGCAGGCAACTTGTCAATCAGATTTGTGTTTT 

TAAGGACTTTTAACTGGGGAGCCCTCCATGACAGATCAGATGAGAGAGGA 

ATCTGGGTCCGCCCATGTGTCAAGCTACCAGAGGGTCCCATCGGTGCTTG 

GATCTTCTTTGAAGCTGGGTCTGAGGTTTGCAGGTAGAGGGTGAGCTGGT 

CAGAGGG AC CTATTG CAGAGCT AAC CAACAC CTT C C CAGGAATGCAAG CA 

CAAGCACCCCACCGCGGGCAGGCGGGCAGGCACTTCTCCTTTTGCCACCA 

GGACCTCACAGAGGCTGATCTGGCTCTGTGAGGTGGGAAAATGGGTTGTA 

CTTAGTACATAGAGATAAAAGGCTTAGGAGGCCCCTCCATCCTGTGACCC 

TGTCCCCAGACCACAGGTGCCGGCAGGTGCTGCTATTTCAAGGCTGGGCC 

TCAGTGCAAGCTTGTGGTTTCTTGCCCACCTGTGATGTCCTCCCACTAAT 

GAAGGGGCTCTCCATCCTCTGTCTGCCTCTAGCAAGTGGAGGCTCTGGGC 

C CTGGG C AAG ACACAGGGGG AAATGC CAT CTGTT AT C CAAAT AT ATTT CA 

ATGTGACAGGAAAGCTGTCTTTAGAGCACAGC 

:>Contig52 

GCATGTGCTCTACATTGATCCCAGGAGTTTGAGACAACATTGCAAGACTG 
GGCAACAAGCAAGACTCTGTCTCTACAAAAAATAAAAAAAATTAGTTGGG 
CATGGTGGTACATGCCTGTGGTCCCAGCTACTCCTAAGTTGAAGAGGGAG 
AATTGCTTGAGGCCAGGAGTTCAAGGCTGCAGTGAGCTATGATCACACCA 
CTGCACTCTANCCTGGGTGACAGAGCAAGACCCTGTCTCTAAAATAATAA 
TCGTAATACATTTTTTTTAAAGTAAAACAAAAAAAGGTCACACTTTCTCA 
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TACCAAAATAAATTCCAAATAAATTAAAGGCTTAAACATGAGAAAGTTAA 

ACCATAAAATTACTAGAAGAAAATAAAAGCAAATATTTAGATAATCCTGG 

GGATAAATTTCTTTGGAATGAATTTCCTTAAGATGAATCTCTAAAAGTGA 

AATTCAGGG7TCAAAGGTCTTTTCTT7GTCCTTTTCTTTTCCCTTTCCCT 

CTCCCTTTTTCTTTCTTTCTCTTTCTTTCTTTCTTTCTTTCTTTCTTTCT 

TTCTTTCTTTATCTTTCTTCCTTCCTTCCTTCCTTCCTTCCTTCCTTTCC 

TGGTTGCTTGCTTTCTTTCTTTCCTTCCTTCCTTCTTTCTCTCCCTTTCT 

TTCTCTTTCTTTCTCTTTCTTTCTTTCTTTCTTTCTTTCTTTCTTTCTTT 

CTTTCTTTCTTTCTACTTTCTTTCTTCTCTTTCCTTCCTTCCATCTTTCT 

TTCTTTCTTTCTTTCTTTTCTTTCTCTCTTTCTCTCTCTCTCTCTCTCTT 

TCTTTTTTTTTCTGGTGAGACAGGGTCTCATTCTGTCACTCAGACTGGAG 

AACAGTCGCATGAACATGGCTCACAGCAGCCTTGACCTCCTGGGTTCAAG 

CAATTCTCCTGCCTCAGTCTCTCAAGTAGCTGAGACCACAGGCACCCACC 

ACCAAACCTGGCTAATTTTTGTATTTTTAGTAGAGATGGGGTTTCACCAC 

ATTGGCCAGGCTGGTCTTGAACTCCTGACCTCAGGTGATCTGCCTGCCTT 

GGCCTTCTGAAGTGCTGGGATTACAGGCTGGGCCTCTACGCGCGGCCGAG 

ACTACCTCTCTTTTAACTGGATCTCTGAGCTCTGGGCAGAGCCCACCCTG 

AATCCTGGTCTCCAAAAAGGGAAAATTATTAGGAGGCTAGACCATATGAT 

GCTTTTACAGTGCACTTAAAAAAAAGTTTGTTTTTTTTTTAAAAGACATT 

TCTACATGTCTAAACTACAATCTTCCTTGAAAACCCAAGAGTAGCTTCTG 

TTGCAATAGCTAGTCAAAAATATAATAGTCAAAAAAATCAGGTAACACAA 

CACAAACGCAAGCAGTTTAAGAGCTGAAATGAACTTGTCTGTTTACACTC 

TAGGGATTCCATAAGGAAAAATAGAAGTTTCTCCCTAAAAGGGAGCCTGG 

CACCTTCTCCATTTTCTTTAAGGAACCCCAGGCTATTATAAACTATTTTA 

GGGCTCTCATGCAGCAGACGGTGCAAGAGAAAGGAGAGACAGCAGAAGTA 

AATGAAGAAAACAGAATCCAGTCAACAGAGAAGAAAAAAACTTTTGCTCA 

AAAAAAGGCAAGTTCCTAGGAAAGAAAAAAAAAACATGAGGGCTATTTAA 

ATACAAAGACGCATACATACACATGCACACATCTTGGATGTTAGCTTTTA 

ATTAAGCTGACTTTTAACTATTGAGGTCCTTTAAAATAAATCTTTTAAAA 

TCTTATTACGATATTTCAGCTAGGACAAATTGCTGCTATTTCAGCATTAC 

C AAGTATC AAAC C AGAAAAGGCTTGATTTAGGAAC CAAAC C CAGGCTGTC 

GTGGTAGGAAAAAAGGCAGAACGTTAGCTATGGAACCCACAGCATGGGGC 

AACAGCCATTGCTCTTTCAGTATGGCCTGGCTAGCAAAAAGGTGGCCTTG 

TTATGTAAATAAAGCCCGTTTGGTGGTCAAAATGAAACATCTTTTCCTTT 

TTTTTTTTCTTTTGCTGGCCGTTTTTTCCCCCACCATACCACGTTTGTGT 

GTGTGGGAGGGTGGGAATTTAGCCACTTCAGAGGCCTCATTCCCCATAAT 

TTGGAAATTTCCTTTGGATTTGATCAAGTCAGATAGAGTAGGTCAAACCC 

AATGGGAAAAAGACTGAAACAGCAATAAAAACAGAAACAAACAGTTAAGC 

AAAATGAATGATCACACAACTTATATGATTACTGAGTGCTC7AATGGTAA 

GGAGAAATTAAGACCAGCTGGTTGTTAAACTTTAGCCAAGACAAAACCCC 

AATTCAGCTACTTACCTAGGGTTGGGTCTCAGGCTGAAGACCGCTCACTA 

CCGTTCTAGAAGCAAGAAATAAAACTTGAACTCGTCTTACCTGTGTAGCA 

GGACAAGCCGCAGACAAAATCCCTCAGACACCAAATTAAAGAAGGAAGGG 

CTTTATTGGGCCTGGAGCTGCGGCAAGACTCACGTCTCCAACAACCGAGC 

TCCCCGAGTGTGCAATTCCTGTCCCTTTTAAGGGCTCACAACTCTAAGGC 

GGTCCACATGAGAGAGTCGTGATAGATTGAGCAAGCAGGGGGTATGTGAC 

TGGGGGCTGCATGCACCTGTAGTTAGAATGGAACAGAACATGACAGGGAT 

CTTCACAGTGCTTTTCTTATGCAAATAACCGATTAGATCAGGGGTCGATC 

TTTACCAGGCCCAGGGTGTGTCACCGGGCTGTCTGCTTGTGGATTTCATT 

TCTGCCTTTTAGTTATTACTTCTTTCTTTGGAGGCAGAAATTGGGCATAA 

GACAATATGAGGGGTGGTCTCCTCTCTTACCTGCGGGGAGTGAGCTCAAA 

CTCCTTAAAGGAGTTACCTGCCTTCCATCATCAGGGAAGCAGGAAATCTT 

GCCTTCCTTGTTGGAAGCAAGTAAAACTCAAAACAAACAAAGAAAAAAAC 

AGGGAGTTGTACAGCAAAATAAACTTTTGATTTTGACCAAATTTTGGGAG 

ATCAGGAATTCTCTGAAGGAGATGCTTTCAGACCTCAGCAAATTGTCCTG 

^TGGTTTGAGCCATAAAGTTAGCTCATGCTGGTACCAAACACCAGTAGGA 

GAT rpT GTCAAAGGTAAGAGGCATCTCCACTCAGAATCCCTTCGTGGTTAC 

CAACA7G7GAACC77GGAAA7C7GAGACAGG7C7CAG77AA777AGAAAG 

^7A7^~~GCCACGG77GAGGACACCCACCCA7GACAGAGCA7CAGGAGG 

7CC7GACCACA7G7GC7CAGGG7GG7C7GAGCACAGC77GG7777ACACA 
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TTTTAGGGAGACATGAGACZATCAGTGAATATATGTAAGATGTACACTGGT 
r T C C C T C C AG AAAGG CAGAACAACTTG AAG CAGGGAGGGAG C TT C CAGGT C 

ACAGGTAGGTGAGAGACAAACAATTGCATTCTTCTGAGTGTCTGATTAGC 
CTTTCCAAAGGAGGCAATCAGATATGCATTTATCACAGTGAGCAGAGGGG 
TGACTTTGAATAGAATGGGAGGCAGGTTTGCCCTAAGCAGTTCCCAGCTT 
GACTTTTCCCTTTAGCTTAGTGATTTGGAGGCCCCAAGATTTATTTTCCT 
TCTACATCACTGTGGGCAGCTGACTAGGAAAGCTTTGTAGGACTGGTGGG 
CAGTGTGAGAGCCCAGTGGGGGGTGGTGGTCCTGTGCCAATGGTAGCAAC 
CACCTGTGAGGCTGAGTAAACTCATTTCCCAACCTCCTCTAGCAGCCCCA 
GTGGAGATACAGATGAAGCAGACTAGCGATACAACCCAGCCTGAAGTTTT 
GTCTGGTGAGTGTAATGGAATAAAAATGGGAAGGGTGCTGAAGAGACCAG 
CAAGAAAATGGTTGAAGAGATGGGGCACAGAAATTAAGCTGGATCAAAAA 
GGACGGAAAAGCAGAAAGGGCCGATAGAGAGAGGGGATATCTATGGGTTC 
GCGATTCTGAAAAGGACAAATCACTGGTGCTTTGAGAAGAGAGAGGGTGA 
GAAAGCAGGAAGGCTGGAGGCTGTCATCCAAGAGGCGGACATCTGTGAAC 
ATGATTCCAAGAGTCACCAGACCATGGGGGTGGCCAAAGGGAGTGCCTCT 
TCTCACCTCCTACTCTTAATTCCTTGTACTCAAGATAATAAGTTCCCAGA 
AGAGAAGTACCCATATTTAATTCATCTGTGTCTTCCTAGCAGTACTAAAA 
ATATTATATGAAAGGTATCAAACCTTTGAGAATGTGTGCTGCTAAATTGT 
TAAGGATGCTGGAAAACTCAAGACGTCCCTGATCCTGAGCCTGAGTATGA 
GCCTGTGGTGAGCCCAATGCAGGTCTCCATTCAGACAAAGGCCTCAGGGA 
ACGGATGAGACCTAGGGACAGAGATGCATGCTGGAGCAGCATTCCCCATC 
CCTACTGCAGCTCAGGCCAGCTGACTGCTTTATGAGTAAACGTTACCAGG 
GAACACTTTGCAGTCTTAACACACATGCCCACCTGTGACCACTGATCCCT 
GTTGGGTGACCACTGACATCAGAGATTCGATGGCAGCAATGAAGACAAGG 
CTATCCTCATTAGGAAGGAAAGGAAGGAGGAGGGAGGAGGGCAAACGAAT 
CTTTCCTGCTTGTCAACCACGTCCATCTCTGTTAGGTGATTTCCCATGTG 
TGACTTTGTTTATCTTTATAATAACTCTGAGAGGTAGGTCTTGATGTCCA 
CATTTTGAACATGAGGACATCCAGCCAGGAAGTTGAGTTCTGGGGACATA 
GCTGAGAGGGCAAAGCTACATATAAACCCCTCTTTGTTTTTTCTGGCTTA 
TCCACTGAGTGCCCCCTGCAATCCACCAGCCCATTTGTGAAGTGCATACT 
ATAGGTAAGTTGGCACAGGAGGAGTGGATGTGGGCGATTTTGTCACAGCT 
CTCCAGGAACTTACACACTGGTGAGGAGGGCCAGGTATGTTCCTGACCAG 
TCACAATCAAAGCAACCTCCTACTAATCAGGGAGGCTTGGTACCTGGGGA 
ATGCTATGTTGAAAGGTTCTTTTCTGGGTTTTAAAATGATGGGTCTATTT 
CCTTATTCTTAAGATTGCTTTTTTTCTGGCTAGAACTTAAAAGAAATTTT 
CAGTAAAATTTCCCTTCCCTGGCACAAAGTGAGCTTGAAATGAATTCCCA 
GGTGGCCTTGATACTTTAAAATATTGCCTCCTATAAAATCAACCTTTAGA 
AGAAGGAAGTCAAAGAACATGCTAGATTTCACAAAGGTTAATTCCTTGAA 
ATCCAGTTATCTACAGGACAATGTTGTCAAAGAAAAAATTATTTGGCCAG 
GCACGGCGGCTCATGCCTATAATCCCAGCACTTTGGGAGGCTGAGGCAGG 
TGGAT CAC CTGAGGTCAGGAGTT C GAG AC CAGC CTGGCCAACATGGTGAA 
ACCCCATCTCTACTAAAAATACAAAAAAAATTAGCCAGGTGTGGTGGTGG 
GCACCTGTAATCCCAGCTACACGGGAGGCTGAGGCAGGAGAATCGCTTGA 
ACCCGGGAGGAGGAAGTTGCAGTGAGCCAAGTTCAAGCCACTGCACCCCA 
GCCTGGGCAACAGAGCAAGACTTTGTCTCCAAAAAAAAAAAAAATTCAAT 
G ATAT T T TT AAATT CATGGTAAGGAAGATTT CATT CAG AAC CAGCACAGA 
AGATATAGGAAACACTGCAATGGGACTTTGCGGTGGGGGAGAGAGATTGA 
ACACAAC T ACAT AT ACAG CACGGGCAAGGACAT ATT CAT AG C CAGGAAGC 
AGAGCAAAGATCAGTGGATGCGAAATTACTAAGAGGAAACATGAAAAATA 
AGGGAGCTTCTGCCTAAACCCACCTAACCGGATCCTTGCTGAAGACAGGA 
CAGGGTGATTGGACACCACTTTGGGGATGGTGGAGGATGGGGAATCCAGT 
GAGATTTCAAGGGTGATCAGATATTGAACATAGAAGGTTCTTGCTAAAAA 
AGGAGTTTACAAGAAAGTGTACAAATGTGCCTGGGAGAAGGTTCAGGAGC 
CTGACTAAAATTTGGTCAAGCAGAGAATATTTGCCAAGATAATAGCTAAG 
TCTTCTGACAAACAATAGATGCTAAGCCAGCAAGGGTGATGTGCTCAGAG 
AAAGCACTGAGGGCTTATTTCCTTTTCCCCCAATCTCCACTCAGTCAAGT 
CTAGTCCCCTTGTCAATGTAGCCATTTGTAAGAATGCAATCAGGCAGGGT 
CCCATCTCCTAGTGACAGGACTGACTGAAGTTCTGCTGAAGAGAGTGGCC 
TGGGGCTGACACCGAGATTTCAGAGTCCTGGGTTTCGCCGAGAGCTCAGT 

FIG, 4 (44 of 61) 

-SDOCID: <WO 9905426 A 1_l_> 



WO 99/06426 



PCT/US98/16102 



GTAGTGCCATGCCCTCTCTCCACCTGAACGCCCAGTGTGGGCAGGAACAA 

CTGCAGCTAGAAGTCTGGCACTTACGCTGGGGTCTAAGACCTGCCTGATC 

TGCTAACTAGTCTTGTCCCTTGGCTATAAACTGACGTTGGCACCTGGCCA 

GAAAGATGAGCAAGAGATCTCTGACACACCTTTAAGTCCCTGTGGAGTAG 

GATTATGTTGGGGAAGGTCATTCTCTTGACTGAGCAGCAATTTCAGAAGG 

AAGTCCCATGCCGAAGTGAGAGAAGGCAGGGAATCCTGCCfAGTCAGCTA 

GASCAAAACAGTCTGCAGGACGGGACCCAGGGATGTGATCCTCCCATCCA 

AAGGCACTGAACTAAATGACTAAAATACTTTCCAGGGCTCACGTTCTTTG 

AAGAATGGGGACTAAAACTAAGACAGGAGCCAGCAAGTGAGGACTTGGAA 

GGAGATGGCTCATCTGATCAGCCTCCACTCAACAATTTTAATCATCCACA 

CTGGCATGGGGACACAATATGAATAAGTTGACAGGGACCTACTCTGATTA 

AGCAGTGGGCTAGTGCAGAGACCTGTCAGTCAAGAGTGGACAGGAGATGA 

TTTCAGACAGTGAGAACAAAATTAACAGAGTCATGTGCTAAAGGGTGGCT 

GGAACTACAGAGGAGTTTAAGACTCAAGAGGTCTGGCTGGGCGCGGTGGC 

TCATGCCTGTAATCCCAGCACTTTGGGAGGCCGAGGCGGGCGGATCACAA 

GGTGAGGAGATCAAGACCATCCTGGCTAACGCAGTGAAACCGCATCTCTA 

CTAAAAATACAAAATATTAGCCAGGCGTGGTGGCGGGCACCTGTAGTCCC 

AGCTACTCGGGAGGCTGAGGCAAGAGAATGGCGTGAACCCGGGAGGCAGA 

GCTTGCAGTGAGCCAAGATTGCGCCACTGCCCTCCAGCCTGGGCGACAGA 

GCGAGACTCCGTCTCAAAAAAAAAAAAAAGACTTGAGGGAGTTGTTTATT 

TTTGTTTTCTTTTTAAGACAGGGTCTTTGTTGGGCGCGGTAGCTCACGCC 

TGTAGTCCCAGCACTTTGGAAGGCTGAGGTGGAAAGATCTCTTGAGCCCA 

GGAGTTTGAGGCCACTCTGGGCAACATAGCAAGACACCGTCTC7ACAAAA 

AATGTGCAGGTTGAGGCTGCAGTGAGCAGAAAAACACCGCTGCACTCTAG 

CCTGGATGACAGAGCGAGACCCTGTCTCGGAAAAAAAAAGAAAAAAGACA 

GGGTCTCGCTGTGTCACACAGGCTGGAATGCAATGGTGCAATCATGGTTC 

ACTACAGCCTGGAACTCCTGAGCTCAAGCAATTCTCCTACCTTGGCCTAC 

CAAAGTTCTAGGACTACAGGTGTGAGCCACCACACGTGGCCTCAGGAGAG 

ATCTTAATAATAAAAGGACAAATTGCCTTGCATCCCTTAGGGGCAGGATT 

GACACATCCAAGGATCAGGCAGAAAGCCTGTGCGGAGTGGGATGAGCAAA 

GAGAAAGGCTGAGAGTTGTGAAGAGGGAGATGCAGTGCCAGCTAGGACAG 

GCCTTTTTGGGCTATGGGAGGTTTTCAGAGGAGACCCCACCTAAACTAAC 

CCATAACATTGCAGTGGGGACCTGTTGAAGTCATGGACTACTACCTGAAA 

GCCAGAGAAATGGGAGGAGCCTTTCCTCTGAGGAGGGACTCTAGTCCATA 

GGTATCTTGCCACCAAATACATGGACAGGCCCTGGGGGAAGATGGTGGTA 

GCCCAGCTGGAGGAAAACCATTTGCCACCTGAACTAGCCCAGGGTAAGCC 

ACCCAGGCACTGAGGGTGCACACCCATGCATGCACACACAGAATCACACT 

CCTTCCTATTATTCCTCAATTCAGGGGTCTCAACACCCATTTTTTTTGTT 

TTTGGGGTTTTTTTTACATGTTTACATTTTATTTATTTATTATTTTGTGA 

CAGGGTCCCACTCTGTTGCCCAGGCTGGAGCACAGTGCAGTCGTGCAATC 

ATATTAGATTGGTGCAAAAGTAATCACGGTTTTTGTCATTAAAAGTTTTG 

CCATTACTTTTAATGATAAAAACCACGATTACTTTTGACGCAACTTAAAA 

GCTCACTGCAGCCTCAAAATTCCTGGTCTCAGGGAATCCTCCTGCCTCAG 

CTTCCTGAATAGCTGGGACTACAGGCACATGCAATCCTACCTGGCTAATT 

TTTTAAAAATTTTTTTTGTAAAGATAGAAAGTCATTTTGTTGTCCAGGCT 

GGTTTCAAACTCTTGTCTTTGTGCCTCCCTCTGCCCTGTGCAAGACCTTC 

TGGATGCCCACTAATGAAGACTTCCAGGGAGAGGAAAAGTAAACATAGGT 

C C CTG AT CAAGGGAC CAGGGTTT AT CGAC CACAAACAGCATG C C CAGATT 

CCACTGGCAGTCCTAGAGGTCGCATTTGCCCCAAGTGTGTGTGGAAGGCC 

TCTCCCTAGCAGTTGGTTTATACACCAGCCACAGCACAGCATATTCTCTT 

AAATTGTGAACATTTGCAAAAACTCCTTGAGGACAACTATCATGTCTTGT 

GTACTTTTGTTTTGTTTCCCTTCCCCTATGTACACGCGCGCGCATGCACT 

CATGCACGCACGCGCGCGCGCACACACACACACACACCCCTCAAACTGAA 

TGCCTGGTGTGCTGAATGGATGAATGGCTAATGTAAGTCATTCTAAAAGC 

TACTTTCTTTGGCATACCATCACCTTTGATTTCATCTTTCTGGAACTCCT 

ATGTTCCCAGATGAATTTGGAAAGCCCTCAGGAAACATTTCAAAATTGCT 

ATATGGGAGAAATGGGAGGGTCTCTCTAGAAATTTACCTGCCACAGGTAT 

TTCTGGTAAGACACAGCAAAGGTGGCACCACCCATTCCTCGTTACAATGT 

CAATGCCAGTCACCTTCCTGTCCCATAAAACTTTATTAAAGGTGCAGAAT 

TCCCATGGAAGCAGGTGGACACCATCTGCTTCCAGCCAGCCAGGGGAGCA 
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AGGTGTCCACTGTGCCTTTGTGGCAGGAACTGCGCTTCTCTACTCTCCCA 
\ CTTTGAGGCCTCTGGGGCTGGCCTGCTGCCTCCTCATTGACAAGGCTGCT 
TACTGAGCAGTTCATTCTGAGCTGGACATAGTGCTTCTGGTGAGTCTCTA 
CT7CTATTTAACCCAAAGATATTCTTTCCTAAGGAAACGCTTTCCTGTCG 
GGGGAGGTTAGCTCCAGATGGAAGTCACAAGTGATGGCATGGTAGCTCTC 
ATCCGTTTGGGTGGATGATATTCACGGAGCACCACCATGAGCCAGTCATG 
GAGGTGAACAGTATATGCCAGCCCTGAATCAGGTGCATTGACAGCAAGGG 
AGACAAGCAAACAAAGCTGAGGTTTGCTGAGGATGTTCAAGACTCACACA 
GCACAGAGGAGCATCCACCACCCAGCTTGGGAAAGGACTTGTTATAGAGG 
GGGTGAAGCATGAGCTGAGTCTTGAAAGACTAGAAATTAGCCAAACTACA 
AGGAGGAGAAGGAGTTTCCAGTCAGGAAGAACAGGTTATGCAAAAGCACA 
GAGACTAGAAAGAATATCACATTCAAGGAACTGCAAATAGACAGGAAAGA 
TTGATGCGTGGGATAGGAGAGGAGGGCAGGGGATTCCAGGTGGGCCCTGC 
TTGCCACACTCAGGAGCTTGAACTTATCCACAAAGGAGGTGTGGAACCAG 
TAATGAATGGGTTTTGTGCAAGGGCTTCATGTCACCAGATTTGCTTTTTG 
GAGATACTTCTGTGGCTGATATGTGAGGAAGGGATGGAGGAAGTTTCCGT 
GGCAATCAGGAAAACCAATTAGCAGATGATTCAAATGGCCTAGGGGAAAA 
GGGAGGAGGACTTGGACTACCATGCAGCAGCAGAAATGGAGAGAAATAAC 
AGATCCCAGGCACTCAGGAAGCGCTCAGAATGAGCCCTTCAAAGAACTTA 
TGGTAGGTGATGGATGGATGGAGTGTGAGTCCTGGGATAGCATTGCCTGG 
GAAAATACTTTCTAGTTGAGACAGGGAAGTGGGCCAGCAGAAATGGAGGG 
CTTCTTCTTTTTGCTTTAAATACTTTTATAATATTTGGAACTTTGAAAAT 
G AGC AG ATAT ATT AGC AAAAAG C CT AAAAGGGATATTTTTGAAAT C ACTG 
GTAGTTCTAACATATAACTTTCAGCTTGCACACATCATCAATTAACTTTG 
ATAGCGCCTTTCTGAAACTATCATCCCAAATAGCAATCCTTGTAAAAACC 
TATTTTGAAAAACGGGCCTTGTAGGATAGCCTCACAGATGTTTTGTGGTA 
GATTTTCTAACATTCTAATGTCAGGGAGTGAAAGGAATCCCGTTAGAAGT 
TGGAAAATTCTGGAATCTCTATTCATGGTATTAAAGTTTTGCCGTCACAC 
AAAAGTTTAACACCTTTACACAATCAGACTTCCTCATTTTACATTGCTCG 
GTAATTAGAGGAAATCAGTCACCCAGAGCCTGGGTCCTAGACTTGACAAA 
ATGCACCCAACAAATCCTGAGTGGCCTTGCTGAGGACTTCTCCCAGAAGA 
TAGAAAACTCAGTTCCAGCCAACAAGGGGGAAGCAGCTGAAGAAGTGAAA 
TTAACAAAGTCCTGGAAGGAAATGACCAAATCATCTTTGATTGTGTAATA 
ACCAGAGAGTAGAATACAGCTACGACAGACATTTTGGGAGAGAAGCATTT 
TATCATAGCTTTTAGAAGAGAATATTTTTCAGCATCATAAGCACACAATT 
CCAAGACAGATACTTTCAAGGGATTGTTTTGACG 
>Contig53 

ATGTTNNGGTTTTGGGACCCCATTCAAACTTCATGTTGAATTTTAATCTT 

CAATGTTGAGCGAGGTCCTGTGGGAGGGTGATTGGATCATGGGGGTGGGT 

TCTCCCTTGCTGTTCTCAATGATAGTGAGTGAGTTCTCACAAGACCTGGT 

TATTTGAAAGTGTGTAGCACCTCTCCCCTTCATTCTCTCACTCGTCACTG 

CTCCGCCATAGTAAGATGTGTGTGTTTCCCCTTTGCCTTCCGCCATGATT 

GTAAGTTTCCTGAAGCCTCCCAGCTATGCTTCCTGTACAGCCTGTAGAAC 

TGTGAATCAGTTAGACCTCTTTTCTTCATAAATTACCCAGTCTCAGGTCA 

TTCTTTATAGCAGTGTGAGAGTGGATGAATATAGTGCCATATGTTTGTAT 

TCCCAGCTACCCAGGAGGCTGAGGTAAGAGGATTGCTTGAGCCTGGGAGT 

TTAAGGCTGCAGTGAGCCATGACTGTACCACTGCTCTCCAGCCTGGGTGA 

CAGCGAGACCTTGTTTCCAAAAAAAAAAAACCCAAACTGTGTAAAATGTG 

TTCATAAAAGTGTCTTGCTCCCACACCTGTCCCTATATATCTTATTCCTC 

AGCCTCCGACAACTACTTTATTCATTTCTTATGTATCTTCCAGAATCAAA 

AAAAAAAAATCAAATACAAGCACAGTGGAATGTATTGCCCTTCTTCCCCT 

CCCTTTTGTTACATCAGAGTTAGCATATCATAAATACGGTCTGCATTTTC 

TTCTTTTTCAGCTATCAGCATGTTTTGGAGAGGATTTCATATTCGTGCAG 

ACAGCATGTATTAGTCAGTCCTTGCATTGCTATAAGGAAATACCTGAGAC 

TGCATAATTTATAAAGAAAAGAGGTTTAATTGGCTCACAGCTTCGCAGGC 

TGTTCCACAGGAAGCATGGCAGCATCTGCTTCTGGGGAGGCCTTAGGAAG 

CTTTTACTCATGCAGAAGACAAAGCGGGAGTGGATGTCTTATATGGCAGG 

AGCAGGACTGAGAGAGAGAGAGAGAGAGAGAAAGGATGCCACATACTTTT 

AAACAACCAGATCTTGTGGGAACTCTGTCACGAGAACAGCACCAAAGGGA 

T AGTG C T AAAC C ATT CAT AAG AACT C CAC C C C C ATG AT C CAATCAC C C CA 
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CACCAGGCCCCACCTCCAACATCGGGGATTACAATTTGACATGAGATTTG 

GGCTGGGACACAGAACCAAACAATACCAGAGTGCTTTCTCATTCTTTTCT 

ATAGCTGCCTAGTATTCTATGTCCTTTACTTCATTTAGGCAGTCTCTTGT 

TGATAGACACTTGGGTTACTTCCAATTTTTCCTATTACAAATGATGTGCA 

ATGAATAATTTTGATCATTTTCCATTTCACATGGGTTATGTCCATCTGTG 

GGATAAATCTCCAGGAGTGAAATTGCTGGATCAAAGGGGAAGTGCACTTG 

TGATTTTCATAGTTAGCAAATTTTGTTCTATAAGGGTCATATCAATTTAT 

AGTCCCACGCGTAATATTTAACAGTGGGGATTTCCCGACAGTTTGACCAA 

CAAGGTCTGTTGTTAAACTTTTGATTTTTGTCAATCTGATGGGAAAATAC 

TAGTATCTCAAAGTGCTTTTAATTTGACTTTCTTATTACAATGTTAAGCA 

TCATTTTACTCTGCCCAAGATCAAATAGTATTTTCTTTTCTGTGAACAGA 

CTGTTAAGATCCCTTGCCTCTTGTTTTGCTGGATTTTTGTTCTTTTTTTT 

CAAATGTTTTGAGGCAGTTCTTTACATGTGAAACAAGTTATCTCTTTATC 

TGGGGTGTGAGTTACAACTACTTTTCCTCTGGCTTGTTTTGCGCTTTGAC 

TTTGCTTCTGGTGATTCCCGCAATTCTGAAAGTGTACTTTTTGCATCATT 

CATTCTTATACACCCATGCTCTTGTTCACGCTGGTTCCTCTA€CTGAGGG 

CTTTTTCTTTTCTTTTCTATCTGGGAACATTTTTTAGAGACAGGGTCTCA 

CTCTGTCATCCACGCTGGAGTGCAATGGTGCGATCACAGCTCACTGCAGT 

CTTGAACTTCTGGGCTCAAGCAATCCTCCAGTGTCAGCTTCCCAAGTAGC 

TAGGACTACAGGTGCATGCCAGCATGCCTGGCTGATTGTTTTATTTATTT 

ATTTATTTTTTGTAGAGATGGGAGTCTCACTATGTTGCCCAGGCTGGTCT 

TGAACTCCTGGGCTCAAGCGATCTTTCTGCCCCTGCCACCCAAAGTGCTG 

GGATTACAGGCGTAAGCCACCATGCCCAGCCCATGTGTGGAAATCTTCTG 

TTTATCCCTTTAGGCTTGATTCTTATGTCGTTCTCCTCCCTCCTTCCTGG 

CTACTCCTCTTGTTCTTTATCTTACTCTACTTGTCATGTTACCTTGTTTC 

TGCTTATAACTAGCTGCCTCTCCTATCTGAGGAGGGACTTGTGACTGTTC 

TCATCTCTGTACTCCCAGGTCCTAGTACATAGCGCTTGCTCAACAGATGT 

TTGGTGCATTGATAGATAAATCAATGGTAGCTGTTAATACCAGTCCTGAC 

TCCCTGCAGTGCTTCAGCTGATCCTGTTCCAGATGTGCACTGAATATCTT 

TCTGTTGAACAACAGAAATAAAGGGGATGGGTGAGGAGGATAGTCTTCGG 

TGGCCAAGGATATTTGTAGGTACTTTGCAGCACTCAGCAATGAGGAGTGG 

GCTTTAGTCCCCCAAGAACTCTCACAGCCCTGTTTGTCTTTACTGTTCAG 

TGTCAAATCCAAGACAAGTCAATGATCAGGAAAGACCTTTTTTTTTCTTC 

AGTGAAGTTTATTTCAGAACCATTGAACAGTATGATATTTGCTCATTTAT 

AAATATTCCCATTTAAATAATCTGAGCTTATATATTTTCAGTCTTAATTA 

AAGGACTTGATTTAAAGAGAGCACACCAGTCCAAATTGAATTGATTCCAT 

AGCTATTAAAAACTAGGCTCTTTTACAGACACTGCTACTTCTTGCCCCCT 

TTGAATAAATTAGACCAATGAATAAAACAAACAAACAAATAAATAAATAA 

ATAGGGAAGCGGTTGCTCATCAGAATGTGGGAGCGAATGACAGAGGGTTT 

CTTAGAACCAAATGTGGCCGTGGTTTCTGTCAGGCGGGCTTTAAGTGAGT 

AGGAGAGGTGAGAGAGGCCTGGCTCAACAAAAGGGCTGGGGATTGGCCCT 

GAAAGGAGAGAGCTGACTGTCCTGGCTGATGGACAGGAGATCCTCTTAGC 

ACTACCCTAAGGCAGGCAGTTGGGCATTGGTGTAGACAACAGGAAAGTCC 

AGGCTATAGCCGTACTCAAAAACCTTTCTGTTCCCTTTCTGCCAGCCCTA 

GGGATTGAGTCCACATTCAGCACAGGACTCTCTGGGTACAGCTCTCTTTA 

GGAAGACACAAATTGCATGGTGAAGTCAGTTATATCCTGGCCGCCTTTGG 

TCCCTCCCAGGAAGACGGGCATGTTTTCTGCTTGAGAGGTGCTGATGTAC 

CAGTTGGGGAACTGGGCAGACTCAAATTCCAGCTTGTTATTGATTTCTAT 

CTTGTTGAAGACAAATCGCTTTTCCATCTTCTTCTTTGGGTAATTTTTGG 

GATCTACACTCTGCAGCGAAAGAGAAAGAAGAATTTTTGTGGGGCAAGGG 

ACAAAAATGCTATGGGAAAGATGTTCTTTGGGTTGGCCAGAAAGGAAACT 

GACGAGCAGGTCACATGATCAGGAGCCACACTCCTGAGTTGTAACTGGGC 

CCCCAACTTTCTGTGTGATTATTAAAAGAGCCCTTCTTCTTTTCTAAAAC 

GGGGGGGTGAAAATTAAGCTAGAGCTTCTTGAAGTACCTAGTTTCCAGGG 
G CTTTTTATTGTATTTTTC CTTATGGT C CTAGAATGACAT CAACTTGGAA 
ATGAAGCTTTTGCTGAGAAAGCTGGAGGTGATAGTGGTGGTGATTTTGGG 
AGTGGAGTGGACGTGATAATGGGACCCTTTAAGTCATCTATTTCCCAAGG 
TGTCTATCAAATGAGAGCAGCCCTAACAATATATAATCTGTTGGGGTTGT 
AACTATGGTAGGACATAATAACATCGGCAAAATGATTTAATTTTCTGCAG 
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CAGGATTGAAGGTTGCA^GCAGTTAAAA~ fTATGTTAAATTTATTTACAT " 

TAATGCAAAATTGTCAAATAGACCTGTTCCCAGCTTTTCCTAGGGATGGG 

GGCGGGGAGAAGGTGGTTGTCTGGGAATAAGTGGTAGCAGGAGGCTGAGA 

AGGGCTTCATTCCATAGCATTCACTTACCTCCAGCTGTAGAGTGGGCTTA 

TCATCTTTCAACACGCAGGACAGGTACAGATTCTTTTCCTTGAGGCCCAA 

GGCCACAGG7ATTTTGTCATTACTTTCTTCTCCTTGTACAAAGGACATGG 

AG£ACACCACTGAAGAAAGAAGGGGGTCTTGTGGTTAGGGACACAGCAGT 

GCAGGGTCACCCCAACCCCTAGGCCCCATGAGTAGGATACATGTAATTTG 

GTAGCCTCTGTGGGAACCCACAGTGAGGTTCCTTGGCCTAAGACACAGGA 

TAACTTGACTTCTCACAGACAATAGCAGGGTCATTTTGTTGATTTAGGGT 

TTCCCCTCAAAGGCCTGAGGGTTTCTCAGAGCCTCATAGCAGTAGGAACG 

GAGAATGAAAGAGGGTCTACATTTTAAATGCTGAAGGAAGGAAGGAAGGA 

AGCCATTGTGTCACTGGCTGGCAATGTGCCCATCCACAGGAGCGGAACAA 

CTTGATCAATGTGGAAGGAAAGGAAAGAGGTGAGGCTGTACTTCTGCCAG 

AAATCAGGCACCAGAACTGTTTCAGGAACAGAGAGTAGCCCATGGGAAGA 

AACTGGGAGAGGAGAGGCTGAGCTGGGAAAGTGGCTCCAAAGAGAGACAC 

TCATTTTGATCTTCCTCAGTCACAGCAGTGTCAATTGGAAGGCCCTGGGA 

TCACTCTTACTACCCGATTCCAAAGAAACAGGATTTTCTTGGCCTGGCTG 

AGAGCAAATAGCTTCCCCCTTGAGTGAGGCTGTCCTTCAAAGTCAGCAGC 

CTTAGTTGCCCACACTCCTGTGCAGAGGCTTTGGCTACTGTGGCACGATG 

CCAGGCAGATCACCACAGCTAATGATGGGTTCACCGCACTTGAAACTTTT 

GCCCGTTACAGCGGAGAGATATAAGTTCCTGCTGGGCGGTAAAATTTCCC 

TACAAGGAACCACCTGGCATTGGGTGGGACGGATGTTGGGGCAAGGGGGG 

AAGACTGGGGAGGGGGATGGACACATTATCGCTCCAGCACTCTTGTTTCA 

GCCTCAACAACAGGAAGAGAGAACCCACAGGCAGTTAGGCCATGTCCATC 

AAATGACCCCATATTGTGGAAGAATTGACATTGCACTATGCCCAAGAGAC 

TTGGGTGGACATGGTCCTGGGAGTGCTTGAGCCGTCTAATTTCTCAGGGT 

CACACTCCTGTTAACAAATGCACTGGCCAGTGCAATCAAATGTGCCATTT 

CTAGGACCAAAGTTTGTATATTCCTTTTTAATATTTTTTTTCACTTGTGT 

TGATCATTTGCCTTAAATTAACTTTCTACTTTGTTTAAAACATGGAGAAT 

TAGCAAGCTGCCAGGAAGCCAGGCAGGGAAACCAGGATGTTTCCATTTAC 

CTTGTTGCTCCATATCCTGTCCCTGGAGGTGGAGAGCTTTCAGTTCATAT 

CAGTGATCGTACAGGTGCATCGTGCACATAAGCCTCGTTATCCCATGTGT 

CGAAGAAGATAGGTTCTGAAATGTGGAGCACATGTTGTTTAGGTATAAAA 

TCAGAAGGGCAGGCCTCGTGAGGCAAGGTGGCAAAATTTGATTTCTTGGA 

GGACACCTGAGCATATACGGTCAAAGTCTGATGACAACACCAGTAGGGAT 

GAAGC7GGGAGTGGGGTGGCTAAGAACACTGGACCTGACACTATTAGACA 

TGGGTTCCAGCTTCAGGTCTATTACTGCTCACTGTGGCCGAGCAACAGAG 

CTACTTAGGTAAAATGGTGATGGTCATAACACTAGCCCACAGGGAGGTTA 

CGAACCTCTGGTGACAATGTAAGTGAAAGGCCCCTGAGAAAGAGTGAGGG 

AGTTGCAAATGTCAGTAGCCATCAAGATCTTCTTTAAGAATAGTTTCCAC 

7AAAGAGATGATTGCTTTGGTTTCCAGCCTTCTTTGTTTTGTCTCCCCGC 

TGGGCCTTCTACCTTTAAAGGGCTTTGGCTCTGGGGGAATTGAGTTGGCT 

GGGGCTTGATGACTTCCAAGAGGACACAAGTGGAGATCTACTGCCTGCTC 

TTGGCTAACTACCTTCTTCAAAGATGAAGGGAAAGAAGGTGCTCAGGTCA 

TTCTCC7GGAAGGTCTGTGGGCAGGGAACCAGCATCTTCCTCAGCTTGTC 

CATGGCCACAACAACTGACGCGGCCTGCCTGAAGCCCTTGCTGTAGTGGT 

GGTCGGAGATTCGTAGCTGGATGCCGCCATCCAGAGGGCAGAGGTCCAGG 

TCCTGGAAGGAGCACTGCGGAGAGAGCGAGGGAGGGAGCCTGGTGAGGTG 

GT C CTGC CAGGAAC CATGCTTTGACATCAGAGAGTAGAAAGCTCAGAGAG 

GAGGAAAGGGCTTGAAAGAATCCCGAGCTTCTAAAGATCATCCCTCTCTG 

GGCCAGGCGTGGTGGCTCATGCCTGTAATCCCAGCACTTTGGGAAGCCGA 

GGTGGATGAATCATTTAGGTCAGGACTTCAAAACCAGCCTGGCCAACATG 

GCGAAACCCCTTCTCTACTAAAAATACAAAAATTAGCTGGGTGTGGTGGG 

GTGCACCTGTAATCCTAGCTATTCAGGAGACTGAGGAAGGAGAATCGCTT 

GAACTCAGGAGGTGGAGGATGCAGTAAGCCAAGATTGTACCACTGCACTC 

CAGCCTGGGCAACAGAGTGAGACTCTGTCTCATAAAACAAAACAAAACAA 

AACAAAACAAAATAAAATAAAATAAAATAAAAAGATTATCC CTCTCTGAA 

GCTCAAGGAGGTTAAGGGTGTACTCAAGGGCACACAGCAGGTTAGAGGCA 
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GACTCAAGACTAGAATGTGGGCTTTCTCM^CACCTTACAGGCTATTCtTTT 

AGAATAAATCCCATTTC7ACTTTGTTCATCTTTTTTGTACATGCCCCACC 

TACACCATACATGTATACCTTCTCTATATCTTTTTGTATCCCTAATGCTG 

TCACACTATGATTTGCTTTTTCATGCAGATGACCATAACATTTTCCATTC 

ACCTATGCTCACTCAGCAAGTATTCAATTTTTCTACACTGTTCTTTTTTT 

TCCTTTTTCATAACACTGTCTCATAGGCATTCTGCAAATCCTGTGAGAGT 

ACJTTTTGTGAAATGTTACCACTTTCCTCTTATTCAGAGAAGCtCCGTAT 

TAAGGCTTCACTGAGGTTGCCTTAAGGCATGATAATGGTTCAAAGGCTTG 

AAAGACAGTTAAAGAGACCTGTAAGTGCACAAAAGAAAGTTGAGCAGGAG 

AGAATTTCTTGCCTGGAGCAGAGCCAAGCTACTGGAAGAGGCAATGGGGG 

CAAAGGCCAGGCAGACAAGCCAATGGGCTCCTCCCACAGCTGCAGCCAAC 

AAGTTATGCCAGTCTTAAAACTTCTAAAGAAATATGTTTTTAACAAGATT 

GAGGACTGGATTATGAGGCTAGGGGAGGCTATCACAAACTGGAATAAAAT 

AAAGCCAGAGAAAAGTGGCTGCCTTCCAACCTGCACAACTGACCTAGCTA 

GGCTGATGGCTGGGCCACCTAGGAAGGCTACTGAGCATCATATAAAACAG 

AAGGGACAGCAGGAATATAACATGGCTCTTTGTAAGGATGAGTCTGAAAA 

ATGACCATTTGCTGCCCAAATGCCCTTAGCTACAACTGAAAATATTTCAG 

AACTGGAGGTTGCAGGATGCTGGAATCTCAGAGATCATCCAGCTCAGCCC 

TTTATTTTTCAGATGAGGTCCAAAGCGGGTAAAATGACTTGTCAAGGTCA 

AACAGCAAGTGAATGGTTTTCTTTCAAGTCTCAATTCATCTTTTTGTTTA 

TATCATCTATGTCTTGTTGTTATAAGCTTCACCCCAGGTAGCAAAAAACT 

ATTCTACTCAAAAGGGGTAGACATATGTTAGTTCTCAAGATCATCTCTTG 

GTTTCAGAGTTTAACTCAAGTGATTGGCATAGGCTGAATCCATCTCTTAA 

AAGGATAATCAAATTTATGTTGAAGACTTGGTTGTCTTCCTACTATGAAA 

TGGGAAACATTATCACTACTCCTCCCCTGTCACCACCAAGTGTGGCCACC 

ACCACCAACGTTAGTGAGTGACTGTGGTGATATGATGACCAAGTGGCCAG 

GTCAGCAAGTGGTGCAGCCTGTGTCTCACTGGAAGAGGTTAAAGTCTTTC 

TAAAACAAAATACCATGGCATCAAAGTGGCCCAGAACTCCCTTCTTTGAG 

CTTTCCCTGTGTTAGAGCCCTTCCTTGGGTTGGGAGTTAAACCCATAGTC 

TTACCTTCATCTGTTTAGGGCCATCAGCTTCAAAGAACAAGTCATCCTCA 

TTGC C ACTGTAATAAAAACAGGGACATGTCTCAATT ATGTCTTCTAAACA 

GGTTTATTTTTCCTTCCCTGTGTACAAGACTTGACTGTTCATAAGAAACT 

GCAAACAGCCTGCCTCTCAAAGCTGCCTGAAACACCTGGCAAGTTTCACA 

GTGATATGCGCAGAACAGTCCAGAAGGCAGATTCTAGGCCTGGCAGGTGG 

GCACCCTGGGTGCTCCCTGTTGGATCTTGAGGCCTAACCTCTAGCCCAGC 

AGAGTCAGCTAAAATCTGAGCTCTCCCTCTCCCTCCAAGCCACACTTTGC 

AAAGGGATTCCTTGTATTGTGGGCTTGGAATCTTTTCTCCCCATTTGCCT 

CTGCAGGAAGCCCTTGCAACAACACATCTGGATAGCCTCCAGGTCCCAAG 

GCTGGAGGGACTTGTAATGGGAAAGTAGTCTTTAAATCAGAT7TACTTGG 

CACCCTGTTTGCCACTGAAAGAGGCAATTTAGGGGAAAAATCTGGTCTCC 

AAGCACAGATAACACTCTACTCTTGAAAGAGGAGACCTGCTCATGTTACT 

GGTCTCAGCGTCTCCACTGACCTGTAATAAGCCATCATTTCACTGGCGAG 

CTCAGGTACTTCTGCCATGGCTGCTTCAGACACCTGTGTAAAAAGGAGAA 

AATGAGTGACTTCCCCATGACGGCTACGTTCATGTGTGATTTCTCTCAGC 

ATCCAGTGCATGGCAGTCATGCAAAGAAATGATCTCTGAGTAAATGAATG 

AATGTGTGAAAGAGAAGTCCTTTGGGTCTAGAGAAAAGCATTTGCTAAAC 

CAAACCCCAACTAGCAATGTATTGGCTAGGAGAGCTGGAGCAGAGGCTTT 

GACACTAACCTTTAGGGTGTCAGCTGTTAGATAAGCAGTATCCATTCCCA 

GAATATTTCCCGAGTCATAAGCATTATATTACACCTGGCATTTTTGCAAA 

AAGCTGAGAGAGGGAGGCAGAGAGGGAAGGAGAGGGAGAGACAGAGAAAG 

AAAGAGAGAGAGAGAGAGAATATGCATACACACAAAGAGGCAGAGAGACA 

GAGAGACTCCCTTAGCACCTAGTTGTAAGGAAGATTAAAGTCATACTTGA 

GCAATGAAGATTGGCTGAAGAGAATCCCAGAGCAGCCTGTTGTGCCTTGT 

GCCTCGAAGAGGTTTGGTATCTGCCAGTTTCTCCCTCGCTGTTTTTATAG 

CTTTCAAAAGCAGAAGTAGGAGGCTGAGAAATTTCTCTGTTGAATACCTG 

ATTTCACAATCAAGTTAAAGGAAAGGGGAAAAGAGTATTGGTGGAAGCTT 

CTTAGGGGAGGGGACTAATAAACTGAGATAATTCTCTGGTTCATGGAAGG 

GCAAGGAGTAGCAAACTATGACACATTTTGCAAATGTATCACCATGCAAA 

TATGCATTGTTTTCCTGACAATCGTTGTGCAGTTGATGTCCACATTAAAA 

TACTGGATTTTCCCACGTTAGAAGAATGTTTAAATTTAGTATATGTGGGA 
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CAAAGTGGAAGACACACAGATTTATACX x GCACATACTTTTCTTCATTCA" 
CTTCTTTGTACTTAAGTTTAGGAATCTTCCCACTTACAGATGGATAAATG 
GGTACAATGAAGGGCCAATAGCCCTCCCTGTCTGTATTGAGGGTGTGGGT 
CTCTACCTTGGGTGCTGTTCTCTGCCTCGGGAGCTCTCTGTCAATTGCAG 
GAGCCTCTGAGGAGAAAATTGACCTTTCTTGGCTGGGGCAGAGAACATAC 
GGTATGCAGGGTTCAGGCTCCTGACGGAGTTGGGGCAACCCTGGAGATAA 
GCTCACACAACCCTGCAAGACCAGGTGCTGTTACCCTAGCCAATCTCATG 
GATGAACCAGATCAATGCCAGATGAGCTCTGCCTAAAATGATTTTTTGGT 
GAACTCTGAAAAGTGGAATATTGTTTCTGTAAGAATATCCATCTGAGACT 
CTATCTCTTGGTAATACCAAGAGTTATCAGTTTCTCTTTAACCGAGACAC 
CAGCAAAGTGCCTGCTCCAGGGTACTGCCCAGGGGAGCCCTCCATTTGTA 
GAATGAATGAGAGTCCAGGTTATGAACAGTGCCTGGAGTGTAGGAACACC 

TGAGACAGAGTCTCACTCTGTCGCCCAGGCTGGAGTGCAGTGGCACGATC 

TCGGCCCCCTGCAAGTTCCGCCTCCCGGGTTCACACCATTCTCCTGCCTC 

AGCCTCCCCAGCAGCTGGGACTACAGGCACCTGCCGCCACGGCCGGCTAA 

TTTTTTGTATTTTTAGTAGAGACAGGGTTTCACCATGTTAGCCAGGATGG 

TCTCGATCTCCTGACCTTGTGATCTGCCCGCCTCGGCCTCCCAAAGTGTT 

GGGATTACAGGCGTGAGCCACCGTGTCCAGCCTGTAACACTTCTTATAGC 

ACTGAGTTGAAACCTTGCTCCTCCTGGTTCCTCCAGGAAACTGAAATCTT 

TTTGAGCCAAGTCTAGCACAGTGCCTGGCATGTACATTCAGGTGGTAGAG 

TTTGCTGCTTGAATGGGTGAATGGGAATTTGACAGCATTTTTATTCAAAT 

TAGTATGTGCCAGGTATCGTGCTCGCTCTGCATTATCCAAGGGAGTGAGC 

CTCTGTGCAAGTATTTGAGACACGAGGGAAATAGGTTCTACTGTGGGAAA 

AAGAGCATTTCATGGACTTGCTCTCCAAGCAGCCTTCTGATTTTTAATTT 

GGCTCCCAGTATCTTGATATCAGGAGTCAGTCACAAGAACTCCATCTTTA 

GTAAGTTATATTTTCCACAGGAAATCTAAAAGCTGTTCAACATGTTAGTT 

TCCTGTGAATTTGATAAGCCATAATCCATTCCTAACACTGAGCCCTCCTG 

AAATTTGGTGTCTGGTCCTGCAGATAGCTAAAAGCCCTGTCTGGGTGGCC 

TAGGGGACTCCTCTGTTTTGCCTCCACAGGATCCACTTTGCAAATTAACC 

ACTGGTTCTCCCGTTGTAGGAACTGCCACCTTCCTCAGAGCCTGTCTTTC 

TTCCTTCCTTCCTTCCTTCCTCTTTCTTTTTCTTTCTCTCTCTCTTTCTT 

TCTTTTCTTTTCTTTCTTTCTTTCTTTCTTTCTTTCTTTCTTTCTTTCTT 

TCTTTCTTTCTTCCTTTCTTTCTCTTTCTCTCTTTCTCTCTTTCTCTTTC 

TTTCTTTCTCTCTCCCTCCCTCCCTCTCTCTCTTTCTTTCTTTTTCTTTC 

TTTTCTCTTTTCTTTCTCTCTTTCTTTCTCCCTCCCTCTCTCTCTTTTTC 

TTTGTCTCTCCCTCCCTTCTCTCTCTCTTTCTCTTTCTCTCTCTCTCTCT 

C CTAGACAGGATCTACCTTTATC C CC CAGGCTGGAGTGCAGTGGTACAAT 

CATGCATTCATTGCATGATCACAGCAGCCTCAAACCCTTCCTCAGAGTCT 

TTATGCGGCAACCAGCAGGGTCTGGAGGGTTGGTGGCTCTGTGAACTCTC 

CTGACAGAACACAGAGATGTCTTTGGTCTGTTGATGTGATTACAAGCTGA 

ACGAAGGAGGATCAAAGCCAGTGACAGGAAGGGAGATATGCAAGGGACCC 

GAGCATCAGCTCTGAGTTAGTCCATTCTGCTTCTGGGACTTGGGATACAG 

GTCAGAAACCTTGAGCTTCTACTTCTCCATCTTCCAATTGTAGCATCCAG 

GACCTCAGAATCTGCCAGCTAAGAGGAGCCGTAATGATTGTCTGGTGGGA 

TATGGTGGGACCACAGAGATGAAGACATGAATAGCTATTTGAATGTGAAC 

AGCAGACGAAGAAATCAAGGCTAGGAGGGTGGAAGTGACTCATCCAATAG 

CACAGTGTGGTTGAAGCAGCACTAGTATCCAGGTTGCATGAGCCCCTGAT 

GCTTTCGCTCGAGGGAAATTTTGGAGCCATGGGGCAATGCCCCCTGACGT 

AACAGTCTCCACAGTTCTGCCATGTCTCATCCTGGCCCTGTAACCTGGAC 

CCAAATCTGCTACCATCCCATCCATCTCAGGAAGTGAAACCTCTTATGTC 

AAATAGGTTGTGCAACGTATGTATCAGATCCTGTCTTCCCAAGGAGACCG 

CTCAGGCCACAGCACTTCCTTCCGATCCCCAATGAGCAGAAAATATCTCG 

CTATAAACATAGTTGGCACTAAGGGAGGGAGTGGAAGAGTGATGATGATG 

TAGATGGTGATGTAGCCCCAAGGAAGTGGAACAAGCAGAGATGGGGAGCT 

GGAAATG C CAGGATG CTCCAGCTTTTGGGGAATTATTCAGCTCTTG AGT C 

ACTAAAGCCTTTCTCAGCTGCAAGTTCCTCTTTACCCTGTCAGGTCATTC 

TTCCAAGACAGGAGACTGACATTTATTCAAAGCAGCAAGTGCCCTGATAC 

ZATCTTG7GTCTAATCATGGGCTTCGCAGCCAGTTATCAAGGTTGATCTC 

ATCTCATTGGTCTTCAATCATTTTGAACAAGAAGACAAGCAAAATAATCA 
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tgggttagttcttatattattgtgtgtacatgcagtgatgtctgttcttt 

gtagtgagctgttccttccttgttcaccctcttgcttagaacagaactaa 

gcaatctgcccccaacattttccccaatttcccatctcattcttggcact 

ggcttcctaatatttgttcttatgagtcattttcttgtatcatttccatg 

agtccctctgggatcttaaagtatgaaaaatgttgtgtgtacccacacct 

gtctttgtggatatttctctcctttcccttctgcttctgggattatttgg 

gaa^gggcactatgatttttatcatatcgcttccacttcctttatggcat 

catctccaatgggcttcttctccctcttggatccaggttctcagattggg 

gacatgcagagtccaaggaacattccattctcctccctggtctagaacaa 

ggagggcttagatatatgagcaggtggctggggctggcgagctatgtagt 

ctccaatggcttttccctgatgtcggagttgttatgtcagttctgggaga 

ccaataagaccttgtccttcctttggatccatcagaaaaagcccctgggt 

gggtaagatggatggcagggctctcctactctatgtcttttctcacacct 

agtgggtataagagaggggaccacaaacagagggggctctggtaccactt 

atccagggtctggaaacattttctgtaaagggccagataataaatgtttc 

aggtacaactactcaaccttgcatcatttcagaaaagcagtckgataata 

cataaatgaatgggtgtggctggacttgtcctgcggtcccctgtcttata 

tcattgtattatatcattttttcttacatacaaatttagaagcaatactt 

aaaaaaaaaaagccgtcctttattgagcacctactaagtgccaggtacct 

ttttttccctcattatcttattaactcttcataataacctttaaagtaga 

taatattgaaccatttgacctatgcagaaactgaggttgagagaatIaaat 

tatttaagaccgcacaaacagtaaatgctggaactacgactcaaatatgg 

gttaactgaaccaaaaccagatctttatttctcacttttaattgttagat 

atgtttattgcctcatctcctgtccacatggtgcccatcggcagactcct 

ttctcattctcagtgattgagtgacattctaaactacattggcctggcag 

attcacctctgtcccctaaatgtttccacattgtccttttaggattgaga 

tcctctctgttcccttgtcttccctcctttcttcttctggcggtgacgtg 

ctgtgtgaatttgtttctttctcctctcagggtagtactgggactttcca 

aatcagggtttttaatgatctctcttcncttttctgaatttcttccttat 

tcccattcactttctcatctataagtggcanctttgttgctggaagatat 

cccttgtgcagggattnctctttaanaatttgtcnnnacc 

>Contig54 

GTGATCGTCAACCTCCCACCCTGTAGGGCCTCAAGCATTGAGGACAATCA 

CTGGCTGCCCATTAACCCAGAAATGTTGCCGAGACAGGAGGCCGTGGCCC 

AAGTTCCTGGAATGGGGTATTATTATGTCAGCACAAAGGCCTTTGCACAA 

ATGAAGGCTTTAAAAATGCAGTCCTAGTCAGGTGGAGGAGGGCTTATAGG 

ATTCCCAGGAATCTGGATCATTCTCTTGAGAGCTTTCCCTTGTCTCTGTT 

AAAACTCACATCGTACGGCCCAAATAACAACAAAAAATGGATGTAAATTC 

TTGAAATAACTTGTGGATGGGGGAACAAGGCCCACCCCCCAGATCTGCCA 

G AAG CTT CAGGTGAGGGTC C CAAAT G C CAAAAAGT C TGGT AT C AGAG AGG 

ATGGCCAGTGACNTGGGGACACATGCCCTTTGCTGTGTCACTCAAGGAGC 

AGCAGCTTCGGCCCCGCACAGTGACCAGGACCCTGGCTTCCCACGCTGGG 

CAGGAGCTGGTGTCTGATGAAGGGAATGCCTGGCAGCACGTGCTGTCTGT 

CTCCTCGTGTCAGCTTACCTGGCTTTGCTGCGAAGAGGCCACTTGCATTT 

CTTTATTTTTTATATTTTTTTAATTTTTTAAATTTTTTATTTTATTTTTA 

TTTTTATTTATTTATTTATTTTTAATTTTTTTTTAATTTTTTAAATTATG 

CTTTAAGTTTTAGGGTACATGTGCACATTGTGCAGGTTAGTTACATACGC 

ATACATGCGCCATGCTGGTGCGCTGCACCCACTAACTCGTCATCTAGCAT 

TAGGTATATCTCCCAGGTTAATCCCTCCCCCCCTCCCCCCACCCCACAAC 

AGTCCCCAGAATGTGATGTTCCCCTTCCTGTGTCCATGTGATCTCATTGA 

ATTTCTTTAAAGGTGGAATCTCTCAGTGGGGTCTAATCTGTTCAGAAATA 

TCAAAAGAGTATCCTTGGGAATGACTGGAATTCCAGAGTCATCTGGTAAT 

CCTCATAAAACAACTCCTGGATGTCTCTCAGCACATCTCCCACCTTGAAC 

GCAGGAGGCTGGTTCAAATGGAGGAGCATCGCTCTACTGCACTTTTTTTT 

TTTTTTGGCCTAAAGTGCAAAAGGGGATACGTTTCATGTAAATAAATCAA 

CTGCAAATCGCTAGTTATGCTGAGCCCTGTCCCGTGCTGTGGACACAAAG 

GAACCAAAGGCTTTTCTCCCCGCCCAACACACACATAACACACACACAAA 

ATCATAAAAACATACATAC C C C CAACAC AT AACAACAC AC AACACACACA 

CAAAATATATACACACAACACACACCAAACATGCCCACAAACCTGTGTCC 

AAAAATAAATCCTACTGGTGGGTTTGTGGTCTCCCTAACTTCAAAAATGA 
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AGCCGTGGACCTTCGCACiTGAGTGTTACAGCTCTTAAAGATGGCATGGAT 

CCAAAGAGTGAGCAGTAGCAACGTTTACTGTGAAGAGCAAAAGGACAAAG 

CTTCCACAACCCAGAAGGGGACCCCAGCAGGGTTGCTGGTTGGGGTGGCC 

AGCTTTTACTTCCTTTTGGCCCCTCCCATGTTCTGTTTCCATCCTATCAG 

AGTGCCCTTTTTTCAATCCTCCCTGTGATTGGCTACTTTTAGAATCCTGC 

TG ATTGGTGC ATTTTACAGAGTGCTG ATTGGTGCGTTTTACAATC C C CTT 

G'PAAGACAGAAAAGTTCCTGATTGGTGTGTTTTACAATCCTCTTGTAAGA 

CAGAAAAGTTCCCCAAGTCCCCACTGGACCCAGGAAGTCCACCTGGCCTC 

AC CTTTCAACTC CATAATGGCATGAAAATACATATGTTGTACAAAAC ATA 

C AT ACAC AAAGTATACATG CAT CT C C C CAAAT AT AC ACATACCAC AGAAA 

CATACACACAGGAACTCAGCTACCTGTCAAAAGTCTGCATGGTGATTGCC 

T CTG C AGTGAGT AGTTAGAAAAGTGAATTTGTTTTT CAATAAATTGGAGT 

CCTTAAAAATCGTTGTAAGATAGAAAATTTTTAAAAGTATATAAAATAAA 

ATATGTATGTCCTTTGGTCTAGCATTTACACATGTAGGAATTTATCCTAG 

TGGAGTAATCAATGATATATGCAAAGATTTGGACAAGCATATTAAGCACA 

GAATTATGTATGCATATGTGTGTGTATATATATATATATCTGATACATAT 

AATAATGTAAAAGTGAAAATAACTCAGATGTTCAAAATTGAGGATTAGTT 

AGACTATGATCTGTCCATATGTGACATACAAGTTAGCTGCCCCTTATTCT 

CTCGAGCTTCAACCTCCTATAAACAGTGTCCCTTGTATATCAGTATTGGT 

ACAGATAATCGAACTTATTGAGGTTTTACATGGGGCAATAAAGGCAAGAG 

TTT ATG AAT A C T C CATACTACACTAGGTAGC AC C C C CT ATTAAAG ACAAA 

CTCTTCTCTCTCATTTCCCTTCCTTTCCGGAACCACTTGGTTGAATCTCT 

ACAAGTCTCTATTGCAACTGCCTCAACATGGCACCCTCCCTGCATCTCCA 

TCTTCCCTGTCCTGAGAGCAATGGCCTGCTGCCCCCACACTCACATCCTC 

ATTCATTCCAGAAGTGAGCACCACAGAAGTGCCTACAGTTACCCCAACCA 

CCTTCTTAGAAGATAAGTTAGTGTTTGTTTTGACTTTTTAAAATTTTTAC 

TTCCTCTTTTCCTTCACAATCTCATCCCATCCCAAGAGGTTTATCAAGAA 

GTTCTCTAAAGATATGTGTCTCCTTATGGAATTTAACAGAAATCAGGGAT 

TTGTATTCTAGCCATCAAGGGAATAACATTTTTCCAGGTCTTTAGACAAA 

TAATGGAATACCTTGCAGTAATTAGATACACTATTGTAGAAAAGTATTGA 

TGAAATGGAACGATGTTTGAGATATCATATTGAGTAGAAAAGGCAAGATA 

CATTAAGTAGGAAATGTATCTTACAAAATAATTTGTCAGACACACTCCTA 

TATTTGTATGTTATATAAATGCGTATGTGAAGAAAGGCTAGAGGATGAGA 

CCACAGTCTTCGGTGAAGTTTAAGAGATGAGGCTGCAGCATGCTCAGAAA 

GGCCTGGGTTATAGTTCTTCCAGTAATTAAGGATGTGATCTTGGGTAAAT 

TGTCCATCCTCTCTAAACTGCACCACCTTTTGTCTGTAAAACAGGAAGGA 

TGGT ATTTAC C C C CAGGGTC ATCAAAGGATTTGGTTGGAGAAAAATAAAT 

AAATGGGCTGAGCCCAGACCTGGCACAGTGAGAGCACAGTGGTTGACTAT 

TGTGCTGGCCTGTTGTTCCTGTGTTATTGACATGCTGCTGGTGGTGGTCC 

AGAAGCTATTACCTTAATTGGTTATGTGGATTTCCCCTCATACTGAGCAG 

CTGTGTGTGGTGTTGTAAAACATAGCCATACACAGTAACTGACAAGGGCA 

AATGTGATGGAAAAATGCAAGGAAGTGCAGATAAATAGCTAATGGGCTGT 

AGAAGGAAGCTAGTCCTTGGAGGGCTTGATCAAGGAAGGTCCTTTTGCAT 

GTCACCTTTGAAGAAGAGGGGACATAGAAGAGGTATAGTGCATCCCGGAG 

TGTACCTGGAAGGGAACATGAAAAGAGGACATTTTTCTCTGGGACATGGG 

GACTCCACTTGCATGAACTCTGGAATTGGGGCAAAGAACCATCATGAGAA 

CAAGGGCTTCCTTGAACCTCCCAGGCTCATTGGCTGATCTAAACCCTGTG 

TCCCCTCTTTCCTTCACTCTCCTCTGTTTTCTATACCTGTATTATTGGAC 

TGGACTGGAAGC CAC CTGAT CTATCAC AAGT AC CTTGAAATGTGTTGAAT 

AGGTGTGGCACAGTCCTTAGCAGAGTGGCACTACCCCCACAGGAATTTGT 

TTATACCTTTGGCATGGAAAATAGCAGGAAATGAGTGATCACTGATAACT 

GAGGATGCTATTTATTATTGGCCAAAGGAATACTTGTGTTGTATTTGCAT 

AACCACTCACAAACTGTTGATTACAAATGAGTACCAGACCTAGCTCCTTC 

AAGTAAAGGATCCTGAGAACTGAAGGCAAACAGAGCTCCAGGAGTCCAAG 

ACAGAGCCACAGACCACGAGGATCCCTGGCCCAGGTAGGTGGTCCTCCTG 

CACTGGCTTTCAAGGCCAACAGGATGGATGGGGAAGTAGAGTAGCATCTG 

GCCATCTAGACCCTTGCTTTTTATCCCCACTGGAAGCACATCTGAATTTC 

TAAATATGATCTCTGAGACCTGCCCAGAACACCTTGCTCTCAGCCCCAGT 

AGCAGCCTGCTCTCTCCCAGGAGGGCTTCCACTAACAAGTAGGGCATTGC 

TGGAGGGCCAGGCAGACACTAGCTTAGGAAATCCACCAACCCTGGAAATG 
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CTAGTCCCTTCTCTGAAGGCTCAGAAGACTGACTTTAGAGTCTAGAAAAT 

ATTGGTCCTTGGGAACAGATTTTGAGTGCAAAGAGATGGACTTCAGATGG 

CCAGATGCACTGCTTCTTTAGGGAATTCTGTGAAAGCTCCCTGCATTTAT 

CTTAATACAGGCAGCAGATTTCATGAGTACCCCCGAGGGATGGCCCCAGG 

TCCTCCAGCC7G7GAGCATCCT7C7G7CC77CAGCAGCACCACAG7A7C7 

TTATATGTCTTTGGATACCTACGTTTCTGCCAGACATCTCTTGCTCTGAT 

GTTCTGGCTGCCAAATTCTCTGTCAAGCGCCTCCAATTTTTTGfGTCCTT 

TGATTTACCCCAACATGACAAAGGCAGTTGTGCTTCATGTATTCAGGGAT 

ACTGCCAAACCACAAACAGGTTAAAATCAAATAGCAGATATCCCTGTTCC 

TAAAGACCCATCAGCTCTACCCACCTGCTCCTGCTCACCGTCCTTATTGT 

TGAGTCCTGAAGCCCTTCCTTGTCATTTTTATTTTTTGCATGAACAATTT 

AGTTCC CT77G7CTCAC7CCTAAACC77TCTCAAAGGA77GGA77TG7AC 

ACAAAC7GCC7A7C7C7GCAATCT7AGAAGTGA7A7GATTC7GAACAAAT 

CACTTAACTTTTGATTTTTTATTGGTAAGATGGGAATACCAATTTTTGCT 

CCACTTC7G7CCTATGT7GGCC7GGGC7GA7G77GAAAGC7C7CGG7CAA 

CTGAGATAGGGTGTGCAGAATTTATATATATAAATATATCT€CTCCAACC 

CCTCCCAATGAAGCAAGTCACGTGAGTCAATCCTACCCTAAGATATTAGG 

GATTGAGCCTCCTGGGACATTTGGTGGCTTAGGTTTTCATGAAAAGAGGT 

TGCAGAGCAACTGCTTTTTGTTAGGCAAAGATTAGGCTACTGCAGAGACT 

CAGCAAACTTCTATAGAAGGTGTCAGATGGTAAGTATTTTAGGCTTTGCT 

TGCCAGATGATCTCTCAACTAGTTAACCATGCTATTGTAGCCTCGAAGCA 

GCCAGAGACAATATGTAAACAAGAGCATGGCTGTGTTTCAATAAAACTTT 

ATTTAAAAAAACAGTCAGGGACCGGATTTGGCCAAAGGCCATAGTGTGCC 

AGC C C C AAG ACT AGAGCAATGCACTTTTAACTTTTTT ATTTT ATTTTTGT 

AAAATGCCAAGATCCACAAAAATGCTATTGCACCCCGTGTGTTAGCACTG 

TGACTCAAGGTTTGGGAAATTCTGCTTTGAAGGCGTGATAGACAGGAGAG 

CATGGTCTGGCCCCTTGGTGCCTTTCTGGTTGCAGCGAGCATTTCAAACT 

ACAGAGCAAGGCCAGTGGTCTGTTCAGCACTAGAGACATGCAGCAAGGTG 

TCCTGGGGTGAGAAGATGCCATAACTGGTCCCCTTTCTATCTCCTTAGGT 

CTTGGACTTCATTCCATTTTCTGTTGAGTAATAAACTGAACGTTGAAAAT 

GTCCTTTGTGGGGGAGAACTCAGGAGTGAAAATGGGCTCTGAGGACTGGG 

AAAAAGATGAACCCCAGTGCTGCTTAGAAGGTAAGGTTCTTGTAGAAATC 

TACCTCAGGGCCAAAGTGTAATTCCTAGAGCAGAACTTTGCTAGGTGCTG 

TGCACAGACCCAGTTGTTTCCTGCTGACTTGCACAGTAAGTGAGCTTTCA 

AATTTCCCTGGACAAATAACTAGACAAGAGAAATTCTGGAAGAGAAAAGG 

AAGCTTTGCTTCAGTGTCCAGGCACATCAGGTAGTAGATAAAAGGATCGT 

CCTCACCTACAGATTTGGGGCTTTAGCATCCTGTTTGCCAACTGGATGGT 

TGCA7A7GC7TCAAAA7GCACCTCTTCCCTCCCAACAT7CCCAAG7GGAA 

GAGAAGCCTCCGATGAGAAGGAAC7C7C7AAGGC7GGGCTGAACAAA7GA 

CCCAGGCACAGGGCATCTGAGTATTCCATGAGGAACACATTTGGGTGTTG 

CCCATGGGGGACAATAGGAGGAGGCTTTTGACCCAAATGATTGTCTACTG 

AGGTGTGACGGGAGAGGCCTGTGACATGCCAGAGGCCAAACCCGTGATCC 

AGTTCATCTCTATTCTATGTTTCTGAAGAGGGAAGCTATGATTTAATGTC 

ATTACTATCATGCTGCTCTAGTATTTCTCAGCACATACACAGAAGAGGGA 

ATTAAATGGTCCTTGATACCCCTAAATCCTTGGAAAATCCGAATTGCATA 

TGCTAACCTCACTGCGTCTGACTGCAGACCCGGCTGTAAGCCCCCTGGAA 

CCAGGCCCAAGCCTCCCCGCCATGAATTfTGTTCACACAAGTAAGGCCTC 

GGGGTGAGGTGATGGGGGTGGCTGAGGTGCGAGGGTGGGGATGGGGGATG 

GAGCCATTGGGTCCTCTTACAGGGTGAGAGAATTGTAGAATGGGGACACC 

TAAGGGTGCTGGATGGGGCTGAAGTCTTTCCTTTGTGGAAGCAAATCCCA 

TTAGGAGATAACTCTGGGAAAGATGAGCCCGGGGAGGGGCAGGTGATGCT 

CACCTGCTAAGAGGCAAAGGGCAAGGAAGAGTTTGTGCCTGGGAACCTTC 

CAGGTGCCTCTTCTGACCATAGCCAAGAGACTGGAGACACAGACCTCCTC 

CCAGCACTGAGGACAAACAGCCATGGGGCCAGTGGGGGTGCAGGGACACC 

CACACCACTAAGGGCTCAGGGCGGCGCCTTCAGAGCCTGAACCTTCCTCT 

CATGCTGCCATTTGAACACCACAACACCCTAATAGGAAACTGTTAACATT 

GCCACTGT7CAGGTGTGGAAACCGAGACAGACAGTGGAGATTCCCTGCCC 

TAGGTGACACAGGTAATAAGTGACAGATGTGGAAATTTAAAGGTACTATA 

\CG7CTGTCTGCC7GAC7CAGGC77AAGGCTCCCATCACC7CC7C7TC7C 

AGGACAGAG7CAGGAGGCCTCAGCCTGAGCCCCAGCTCTAGTGCAGGTTC 
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ATGTGGGAATACTGAGCu rCACTAGTAC^nATGG CAGAGAGGAC CAAATGG 

GACCAGGTGTGTAAGGGTGCCTGGCACAGTTGGGGGAGGCTGCTGTCGCT 

TCTCCACCGCTGCTGCTGCAGTTACCTTTGATGTTTTAGTTTTGTTGTAG 

TTACACCATTGCTGGCTTTGGATCTGCACTGTGTCCACTCCAGGTGGAAC 

CACGCACACAAGCCTCTCTGTCGGGCCTGTCCTGACTTCTCCTTGTCAGG 

GCTGGGATCTCCTTCAAATCTGGCGGAAGTGG7TCTCCAAGTCTGGTCCT 

CAPlACGTCAGCAGCATCAGCGCCTAGAAGTGTTAGGAATACACATTCCCA 

GGCCCCACCACAGACCTCCTGCCTCAGAAACTCAGGGCGCTGAGGCTCTA 

GGGGCTGCTTTAACAAGCCTTCCAGGTTATCGTGACGCACCTTGAAAGTC 

TGAGAGCTACTGCCCTACAGAAAGTTACTAGTGCCCTAAAGCTGGCGCTG 

GCACTGATGTTACTGCTGCTGTTGGAGTACAACTTCCCTATAGAAAACAA 

CTGCCAGCACCTTAAGACCACTCACACCTTCAGAGTGGCCTTGAGAAAGA 

TTTGGGGTCAAGGATCATGAGCGAGAACACCACTTAAGAGGATAGTGAAC 

TAGTCTGCATGTGAGACGCTGAGATCCTATGTCAGGCTGTGATAGGAGGG 

AAACAGAAACCAAAGGAAAGAACAGCTTTAAGAAGCGCTTAAGAGGTACA 

AAGTAAAATGATGGTGCTAGAAAAGTAGCTTCTTAAAAAGAGCATTTTCC 

AGTCTCACCCTGGACTAACTGAATGAGAATCTCAGGAGTGTGAGGCCCAG 

GTATCCATGGTCTTAAAATGCCACCCACCAGGTGATTCCCAGTGTGCACC 

AGGGGTGAGAGTCACAGCCTTAGGCCATGCCACTCAAAGGGTGTCTTCAG 

ACCAGCAGCACCCACAGCTCTGGGAGTGCATCAGAAAGACAGAGGCTTGG 

CACCACCCACACCTACTGAACCATAGTTTGCAGGTGATTTCTTGCACATT 

AAAGTGTGGGAAATGGAAAAGCTTAGAGTTCAGCTAGCTCGGTGACTCTC 

AGTCAACCTGCACCTGCTCCATGAACTCAGACTGCCTGGGATGGGCCCAG 

AAAAGCTCCTGAGGAGATTCTGATGTAAGGCAGGGCTGATAACCATGGAT 

CTCATCTGACCCCATATCACTGGGGAGTTACTTAGGATCTTGCCTGGGGC 

CAGTCATCTCTTCCATAGACACTGAGAGTGTCCACGATGCTTGGGGCACT 

ACAGGGTGGGAGGTGGAGGATCACGGGTGAGTCAGATAGGAAGCCTGCTC 

CTGGGGAGCTTACAGTGCTATAGGGCAGCAAGCCAAGGATGCCAATACCT 

GTGTGCAGGTACCACTGACGAGTGCAGAGCGCTGCAGCACCAGAGAGGAA 

GCTACCCTGTGCAGAGGGGGCTGAGGAGGGCTGCAGGGAGATGACAGGAA 

AGCCGGTGTTACAGGAGGAGTCCTCCCCACTCTTTGGGCATGAGGAGACC 

AGGAGGACATTCTACAGTGAGAAACCCAGGCAGAGGCCATGTGCTTATGG 

CATGGGAAAAGAATGACACCTTAGACTTATTCTCTACATTAGAATTGCCT 

ACCACAGATACCCATATTATAGCTTCACATAGTGTGGTGGTTACTGTGTT 

TTCATATTGTCACATTTGCCATTTTCCAGCCACCCACCCATTCTTGACAG 

TCACTGGCCCAGCCTGGGGGCCCCTGTTCTTTATCAAACAAGTGCCTGAG 

CTCTTTGCAGAGGTGAGGGTCACCTGTCCAATCAGAGGCCAGGAGGGAAC 

GTTCCCTTTTAAGACCCTACTCTAGGCAGGCCTGGCCCAAATGAGTTGCT 

AGGAGCCCACGCCCTAAGAACCCTCTGAGCACTGTTGTGGCTGGTCCTGC 

TGCTAGAAGTTGTTCCTCCAGGGCCAGGTGCAAGATTTGTGGCTTTTCAA 

AGGAGCCACTAAAGCTCCAGCTCAGCCTTGCACGGTGCTGGGCTCCTGGG 

GGCTTCCTGCCTCCAACCCTCCCAACTCTTCCATCACCGCTCCCTTAGCC 

TGGCCAGTGCAGGGATCTGTTCCACTCTAGGCACTGCTGAGGGAATGATG 

CCTCCAGTCAGAGGGTGCAAAAAAGAGAGTTAAGAAAAACAATGATTATA 

AAAAGTCCTTTTTATACGCCAGACATTTTCTTTGCTCAGGCTAAGTGCTA 

CTTATTTGAGTAAGC ATT TT A GTTCTCATAACTCCTCTCTCAAGTAGGTG 

CTGCTATTACTTTCATTTCACAGATGAGGACATTGAGGTTTGGAGAGACT 

TAGTAACTTGTCCTCTGTCCTACAGCAGAGCTGGGATTTGAATCTATCTG 

TCCAAATCTGGAACCCATTTGCTTGCACAGAAAGCTTAATTGCTTGTCCC 

AGCAAGATAGAAAGCCTGGGAGTGGAAGAAATATTCAGTGGCTGTGATGT 

CTGAGCCCACAGGCAGGGTGGAGAGCTAGGGCTGGGGCCCTTGGACGTGG 

GGAAGAAAGGGCTGAGTCTTCCATTTTCAATGTGAAGTGTTGATATCTGG 

TGATATTGATCTAGGTCCAAAGGTGAAGAACTTAAACCCGAAGAAATTCA 

GCATTCATGACCAGGATCACAAAGTACTGGTCCTGGACTCTGGGAATCTC 

ATAGCAGTTCCAGATAAAAACTACATACGCCCAGGTGACTCTCAGTTTTG 

GCTGTGTTTTCTGCCTCCACCTAGCAGGGGTAAGGCCTCCTGCTAGGTGG 

GCTCAACTCCATGCTATACCATGCCCCATCTCCAGCAGGTGGTGGAAGCG 

AGGAGGAGAGGCCCCAGGGACTAGGGCATCAGATGAAGGGTCTCTAGCAA 

TGACCAGATCTGAAAGTAGTCTTTCTGGAAGGGCTGGAGAAAAAGAAGGA 

GGCAGACACTTAGACTGGAAGAAGAGGAGGCTTAAACCGGTGTGATGGAG 
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GGAGAAGTGGACCACAGAGTCAAGGGAGAGGGACTGTGCATCfiGGCCTGA'' 

AACCCCAGCAGACAGGAGAGACCTTTCCCTGCTCTCAGAACCCACACATG 

^TCTGACTGTCTTTTTCCAGAGATCTTCTTTGCATTAGCCTCATCCTTGA 

GCTCAGCCTCTGCGGAGAAAGGAAGTCCGATTCTCCTGGGGGTCTCTAAA 

GGGGAGTTTTGTCTCTACTGTGACAAGGATAAAGGACAAAGTCATCCATC 

CCTTCAGCTGAAGGTGAGAGTTCTAGCTCAGTTTCCTGGGCCTTTGGCTA 

CCC CAAAGTAAAAGGCCAAGATCCTCAATGCCTCTCGCTTTCCTGCAAAT 

TCTTATCTTGGCCAATATAACAGGGACATCCACCTTTCTGGAAGCACCAG 

GCAGAAGAGCCCCATAACTTCTTCTCTGGTTCCTTGCCCCTTCTAGGGAA 

GGAGGAGAGACTCCTCACAGCGGGGAGACAGCAAGGAGCTGAGCACCTGT 

T'CTCCTCTCCTGGGCTCACTGGTCCTGGCCCTGGGCGGGTGGCGGTCCCC 

TCCTGCTGTGGCCCTCCATGTGGCAAGCAACACAATTGGGCCAGGACCCT 

GGCGTGCTGCTGTAGGGTAGGAGGGTGTGAGGGAGCACTCGGAGGGCAGT 

GTGTCTGCCCTGCAAATTTAGTCCTGGATGGAGCATCCTTTCACTTGAGG 

GGAGAAATCTTAGGAAGCTGAATTAGATACAGATCTAAGCCATATTCTCT 

AATTTTAAAAACTATAGAGCTGAGATTTTGGTATCCATCTGACTCTTACG 

TCTCTCTCTCTCTCTCTCTCTCTCTCAGTTTATTTTTAATCTGGGGGACA 

AGAAGGCCTGGAAAAGAGGGCATGATTGCTTATCATCCCTTAAATACCAG 

^ACCAAGGCTGACACGTCATCTTTCCCAAGGACCATCTGCCTTCTCTCTT 

TTCCTCCTCTCCTGTGTAAAGGCCTGGAGGATGAGCACATGTGCTGTGTT 

TTCCTCCCTCTCAAAGCCTGTGCTATCTAATTAATCCCTTTTACCTCACA 

GAAGGAGAAACTGATGAAGCTGGCTGCCCAAAAGGAATCAGCACGCCGGC 

CCTTCATCTTTTATAGGGCTCAGGTGGGCTCCTGGAACATGCTGGAGTCG 

GCGGCTCACCCCGGATGGTTCATCTGCACCTCCTGCAATTGTAATGAGCC 

TGTTGGGGTGACAGATAAATTTGAGAACAGGAAACACATTGAATTTTCAT 

TTCAACCAGTTTGCAAAGCTGAAATGAGCCCCAGTGAGGTCAGCGATTAG 

GAAACTGCCCCATTGAACGCCTTCCTCGCTAATTTGAACTAATTGTATAA 

AAACACCAAACCTGCTCACTAAACTTTCTGTCATTGGGTTTCATTTCTCA 

TTCATGCTTTAAGGATTTGTGTTTTTAGGATATAGCAAGAAGCTTGTTTA 

ATTACAAAGTTCTGGGTTGGAAAGAGACCGGCTTCTGCTTGTGTACTGCT 

ACCCTGAACCATCAGACATGCATGTGTGTGTCATATGCTATGATGTGGCC 

AGTCTGAGTGCAATACTTGCAGCGGGAAGGAGCAGCTGGGTGCATGCTGT 

GCTCTAGAATTAGTCTTTCCTACTGGGGTTTGGTAGATTCTGAGGGCATT 

GATCCTGGGGCAGAAGTGGCTGAGTCTGTGTCTAGGGTACAGTGTGCAAG 

AAAGAAATGTAACAGCAAGTCACAATCCAGCCAAGTGATAGTGGAAAAGG 

GGTAGTTAGGTCCCAGATAAGGAGCAGGGTGACTTGACCTGTGGGAAAGG 

CACAGAGACAAGGAATCTGGGTCAGATGACAGCCAGGAGACCAGGTGAGG 

GAGGAGCCAGGTACTGTCTGGGAGGCTTGTCAACAAGGGCATGGTCCTAT 

CACTAAGCAGGGCTCAGATCCTCATAATGGGGGAGTGGAAGGCTGGCCGA 

ACAGAAATCAGGGCCTGGAAACAGAGTGAGGGGGTGGAGACAGGAGACTG 

AGGCTTGGAAATTAGTTTATTAGTTTTAGCTCTTCAGTTACAAGCAATAA 

TAATAGCTTCTAGCTTATTTAAGCAACAAGTATACTACAAAAGGAGCTTT 

CTAGAAGGATATTGGGTATATTCATTTCTTACTGCTGCTGTAACAAATTA 

CCACCAACTTAGTGGTTTAAACAATGCAATGTATTATCTTGCAGTTATGG 

AGGTCAGTCTGGAATGTGTCTCACTGGGCCAAAATCAAAGTATCAGCAGG 

ATAGCATTGCTTTGGGAGGCTCTAGGGGAGAGTCAATTTCCTTGCCTTTT 

CCAGCTTCCAGAGGCCACCTGCATTCCTTGGCTAGTGGCCCACTCCCATC 

TTCGCTGCTTGGGTTTTTCTCACACTGCTTTGCTCTGACCCTCCTGCCTT 

CCTCTTTCACATATAAGAACGCTTGCAATTTACATCGGGCTCACGTCAAT 

ATCCAGGATACTCTCCCGTCTCAAAGAGGCTTAACTTTAATCACAGATGC 

AAAGTCCCTTTTGCTATGTCATGTAACATATACACAGGGTCTGGGGATTA 

GAATGTGGACATTTTCGGGGTGCCATTATTCTGCCTATCATGTGAAGTAA 

CTTTCAAAATGGAAAGACATGCTGAAGAAAAAGTCAGGGATTTCTGGCAG 

GCCAGAAATGACAGAAGGCAGAAAACGTTGGTCCCATCACTCAGATGGGT 

AAGAGCCAATCATGCTTTTTGTCAGTTAGCAAAAGATTGAGATTCCAAGC 

AAAGCATGCAACTGCCCTAGTTTGGGTCATGTGTCGACTCCTTGGTCAGT 

GAAGGGCAGCACACCTTGATCAATACTCCCTCCAAGACTGTATCCAACGA 

GGCCAG7GATGTTCCTCAAAGCAGAGCTAGAGAGCTAATCCCAGGAGAGA 

r.GCG7GTGGGTGGTGGGCAGGAAGACAAAGCTCAGCCGTAAAGGAGTAGT 

AGGGACAGCACCCTAGGCATGGAGGCTCAAGTGAGATGATACCCATGGGA 
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AAAGCTCTGATAAGGTCi%JCTCCTTCTGTTTCTGATCCTGATGGTGATGU ,! 

TGATCAACACCAGCCCAGTGACAAAAAAGTACATAGTATATTTAGTAGAT 

GTTT C C C A CACAG AG AAATGGT AAAT AT T C AAGG C G AGG AAT AC T C CAAA 

CATCCTACCTTGATCATTACACATTCCGTGCATGTAATGAGTACTTGCAT 

GTATGCCATAAATATGTGAAATATTATGTATCACTATATAAAAGAAAAAA 

AAATGTGGCCAGGTGACATCCATATTTTGGAGAGGAAGGCATGTCTTCTT 

CATAATATCACAAAACTATTTTCACAACAAAGACACAGCTGTTCAAATTA 

GTCTCTGAGCCGGGGCTGTCTCATGGCAGTGAGGACTCTGGTTCCCTTAC 

AGACTAGCAGAAAGGAGATGGGGCTTACTGACCATGGCCTTGAGGAGGCT 

GAACATGCAGGCCAAATGGAGACACAGACAGCCTGGGCTTGGTCCTGCTC 

CAT C C C CTT C CAAC CTGATGAGAT ATAGTGAGTCACTATGACGTGGGT CA 

CTCATGCTTCCTGTGAGGCTCCACCAAGACAGCAAGTGCATCAACACCTT 

ACGGAAGCACAAGGCCCTGTTTGTTGTTGACTTCATGAAAGGCATGGTTG 

TGGTGATCGCATTGAGTAGGCTTTTGGGTGAGAGGTGAAAAACCCCAACT 

ATCATGCATTGCAGCCCTCTGGTGGAAACTGTGCTTCAGGCTCTAAATTT 

CAGGCTCTAGACTGACTCCAGGATGAGTATTTGGAAGCTGAAGTCAATCT 

GTGGTCTCTTCTCCTGTAGAGCAGGAGTCAGCACTTTTCATAGAGTGCCA 

GATTCTATATATCCTGCCACATGCTCTGTTGTTACAGAACAAAGAAGGCC 

ATAGACAGCATGGCTGTGTTGGCAAATACACAAAACAGGCAATAAGCTGT 

ATTTGGCCTTTAGGCTGCAGTTTGCCAACCCCTGCACTAACACAGAGCTT 

AAAGGTGGTGGTGGTGTGCTGGAGCTAGCTTATATCAGCTTGCAATAGCC 

AATTGCTAACATCTCTTCCAAACTCTGTGTCTGTGCCTTGATGTTGATAG 

TTTGAAATTGGCTACCCCATTTAATGCTGCAATCTTTTCTCACCCCAGCA 

CTACTGACTCCCCTTTGCCCTGTCTTATTTTTCTCACTCTAACATGCTGT 

ATAGTTTTCTTCTTACATTTATTGTTTGTGTCTTCCACTAGCATGTATGT 

CCCACAAGTTCTTTGCTCTGTGATGTATCCCAAGAACCCACTGCAGTGCT 

TGGCACTTGTAGGAACTCCATAAGATTTTTATAAATGAAGAAAGGAAGAA 

AAAAGAGAGGGAGGGAAAAAGGAAAGGAAGCCTTCTATTTAAATGATGGC 

CTTCTCCATATTTCTATAGTAATATGACTTCCCTTGCAAAGGGGGATGCA 

TTTTGGAAAATGTGTATAAATAAACTCAGGTGGTTTTGAATTTCATTTTC 

CTAACTGTAATTGTAATCATTGGTCTTTATGTTTAGTGAAAAAGTTTTGG 

CCCTTATGCCTCACACCTGAGAATCCCAAAGTATTGGTTTGTTAGAGCTC 

CCATAGAGAACCATAAACTGGGTGGCTTAAAACAACAGAAATGTATCGTC 

TCCTGGTTCAGGAGGCCAAAGTCTGAACTCCAGGTGTTGGTTCATTCTGA 

GAGCTCTGAGAGAGAATCTGTTCCAGGCTTCCCTTCAGTTTGTGGTAGCT 

CCAGGGTTCCTTGGCTGGTGGCAGCAAAACTCCAGTCTCTGCCCCCATCT 

TCACATGACTGTCTTCTCTCTGTGTTTCTGTGTCCAGATTGTCCTATAAG 

GACAGAGTCATACTGAATTAGGGCTCACTCGAATGACTTCATCTTAAGTT 

GAACTGTATCTGTAAAGACCTTATTTCCAAGTAAGGTCACATTCACAGCT 

ACTGGGGGATAGGACCTCAACATATCTTTTTGGGGGACATAATTCAACTC 

ATAATACCCAACATGATAACTGTTCATCCCATGAAATTTAATGTCTCTCA 

AAAGGTGATCTCAGGGCATTTAATCTGTGACAGAAACTCCCATAGGAAAC 

ATTCCAACCAGAAGCTCCTTTCACAGCTGGTCACTCCTCCTACCCCATCC 

GAGGTCCTGGGGCAGGGTGAGGCAGGTGGGGACAAGAAGAAGGCTGTCTC 

GGGTGTAGAAAGAGAAGACCCTTATTCACCCGGCACTCTGTTCATGAATG 

AGCTATCCAGCATAGGATATAATAAATCGCTTTAGGAGTGGTAGACTCCA 

AACATTTTTTTGGTCCCAGTTATCCTAATCAATTAAACAAACTCTAGAAC 

CCATCTTGAAGTGCAGGCATTGGGACATTATGAAACTTACACAGAATTCA 

AAAATTTACAAGGGCTAAATAAAACAGGGTCTGACATCTAATATTTTCTT 

CCCACATTCCCATGCACTGTCTGGCTCAACCATCCCCAACCCTCACTCTC 

ATCCTGGTGGACACATGCCTAGTGATGTGATCAGCTGGTTCACAGGGGGC 

TGGTGATGGTGGATATACAGCTTTTGCCAATTTCCATGGCATAACTACTC 

CAAATATGGCCAATTTCAAACTACCAACATGAAGGCACAGACACAGAGTT 

TGGAAGAGATGTTAGCAATTGGCTATTGCAAGCTGATATAAGCTAGCTCC 

AGCACAGCACCACCGCTACCTTTAAGCTCCTTGTGTTAGTGCAAGGGTTG 

GCAAACTGCAGCCTAAAGGCCAAATACAGCTTACTGCCTGTTTTGTGTAT 

TTGCCAACACAGCCATGCTGTCTATGGCCTTCTTTGTTCTGTAACAACAG 

AGCATGTGGCAGGATATATAGAATCTGGCAGTCTTTAATAAGTGCTGACT 

CCTGCTCTACAGGAGAACACAGATTGTCTTCAGCTTCCAAACATTCATCT 

CTGAGTCAGTCTAGAGCCTGAAATTTAGACTGAAGCACAGTTTCCACCAG 
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AGGGCTGCAATGCATGA'iAGTTGGGGTTTTCACCTCTCACCCZAAAAGCCT 

ACTCAATTTTTTACTGCAAAAACATGTTATCATCATTATTTTTTACTTAG 

CCCACCTTTCCTTGGCAATTTTCCATAGGAAAATGCATTCTAAATTTCAA 

CTAATCAGGGGACTTGGAGCCTCTGGACACCCCCTTGTTCCTTGCCCACA 

GTCCCTTGCAGAAGGTGCCTTATCAGAGCGGCTCCATGCAGGGGCTCAGG 

ACAGGATCAGATGTCAGTTGCACCAAGGGGGCAGGGACAGATCCTCTCTG 

CTSACCATGCAGAAGGGACTGTTCAGTGCACCGTCATGGTCCTGGTGATT 

TCTGGTCCATAAGGGAATTTTCACATGCATCGGGTGATTGTCACATCAGC 

ACAACACTGTGAGGAAGGCAGAGTGAGAATTTGTGTGCCCATTTTATAGG 

TGAGAAAACAGATGCAGAGACATTAAGTAACTTCACCACAGTCATGCGGG 

TTTTAAGTGGCAGACTTTCAGGTGTTGTGACTCCTAGTCCAGAGTTCTTT 

GCACTGCCCCTGAGGTGCTAAAACTCTACTGTGCTTTAAGACTCACTTGG 

GGAGCTTCCTAAAAAGAGAGATTGCACAACCTGAGATTCTTGTTTAACTG 

TTTTGGGATGTAGCTCAGGGATCTAGCTGCCTTAAAAAAAAAAACTCCCA 

AGTAATTCTGATGCAAGCGGTTCTTTTTTGTCCACCTTTGAAGAAACACT 

gcctcctccccatacatttcattagaaaatggtaacatgtttttcagcct 

gagagccatttctgggtgaccggacgtcggcagcccgctgtactagcttt 

cagtctaggcttaaacacacatgataggagatgtcctactccagatgata 

tgagtctgaaccatggaaaaattccattgtgtggcacatctggtgggtgt 

gcactgtccccagcagtgaggcacccagtgaagacagcagctgggagagg 

cttagttacatgcagtgggacagtgtgggctagactgctgagccctctgc 

agtttactctgtgtcaggcaatgagggtgaaaggctgatcagacccacgt 

gc ag a c cat ac c ct c c agggagacagat at cagt caggacaac c c c aagt 

gtagctggagaagcagtgcccaggtatgaccggatgtgtatccaaccagg 

aaatctgcatataaatataagaggagaaaatgaacagatgttgctcttat 

atgtagatatttatgaagagcatataattttgttttgtgtgttttaagaa 

gtttataagtatgccttaaaaatgtatagtatatactgtaggtatttttt 

ccattagatattttgtttttcatacttatccacattgacattgtagcaac 

agtataatataacaacctcctctacaaaagcagaaggaagtgaagctttg 

gaaggaagcacccagtgagcttgcccctttcaggtgggtgcagtgagcag 

gagtcagtgaggttgagatcctttgagaggaggcaatcattaaccaggaa 

atctgcactgcatcctggccacacctaacccttggacaatggtgcttgga 

gcgccttccagctcttaaggcttgcgatttctttctctcactcttcaccc 

acgatgattaaatcttctcctacagagttggacaataaagccttgagttc 

ctgcctcccctggtgtgatcacgaggcatagacatggccaggaacatgta 

ggtgtctttgaaagctgaacaagttagtaaatttcaaacctcatttcacc 

caccagtaaaatgggaataataataaacctattttacatagggttgacaa 

gaggagtaaagagggattcaatgaaagttcgttattatcatttgtagtag 

cagtgttgataatatcaactgaaagttcattatcattattagtagcagta 

ttgataaccctcttttctgtgccttctcactggtgggcccaggccatcag 

caatgcccagggtgtcatggatctctgctgcatcgggcaccagctgtgtc 

aatggtgagaacagtacaagggtgggcagggcaaggcaggaagcacccag 

gagcagcagcttcatggggtgaagatgtcaggagcttagggacagtcaga 

gcgggtgtgcctcctcttgtggagcctttctgcgtgggtaggaactgctg 

cagctgtggccatggattcacctgaatatgggtggaattaggcattcagc 

tgggttagctgtgcctagaaggaggaactctaaactgagaacttgtccct 

attgccacctctgataggcagatgatcca:tccatcagtggctgagctgag 

gtgtgcatggggatgggtaagagcccacacacagggctgatgactgagtc 

tatttagaacaatagatgtaaaatctgataatgtaaaatgtgatagatta 

ttttgtcaattagaaatggtaccatataattatatatatacataaacatg 

tatacatatacacacatatacatgtgtgtataaacacacacagtattgtc 

ccctactcattccataaacctgatgcctttagctgggattcccagctttc 

actctcctctctgtcatctgctgtctatatcctccccatcctgtaattct 

ggcttatatgccacttcctccctaaagccctccctcaatcccttgctgga 

agtgacattttcctctttgagctgcccctgcttgtgctttggtgaggtca 

gctgtattgcagtaccttgtattgtggttgtcacatcatcgtatagaatt 

aatt-ctgacacattccgtatttttcaaagggcctagtgtggggctttaa 

cagtaactacgccaccacgcccagttaattttttgtatttttggtggaga 

-aaggt^^accatgttggccgggctggtttcgaactcctgacttcaggt 

gatctgtctgcctcagcctcctggagtgctaggattgcaggcatgagcca 
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CTGCACCCAGCCACCTATCAAAATTTTAAGTGCCATTTTTATTTTTTATT 

^TTTGTAGAAATGGACAAGCTGATCGCAAAATTCACATGGAATTGCAGGA 

-rTTCCAAATAGCCAAAACAATCTTGAAAAAGAAGAACAAAGTTGGAGGA 

^TTACACTTTCCAGTTTCAAGACTTAGCTCTTAGCTACAAAGCTACAGTA 

aTCAGAACACTATGGTCCTGGCATAAGTGATGCTGGACAGGTGAGCCCCA 

AAGTGGGACTTAACCTGTGAAGGTTCTTGGCCTTGCCCAGGAAGGAATTC 

AAGGGCAAGCCAATGGGACAAGAAAACAGCTTTATTGAAGGGGCAGTATT 

ACAGCTCCAGCCCTGTTACAGCTCCAGCCCTGTTACAACTCTGACTACTC 

CTGCACAGAAGGGCTACCCTGTAGGCAGAGAGTAGCAACTCAGGGCAGTT 

TTGCAGTCATTTATATCCACTTTTAACACATGCAGATTAAGGGACAATTT 

ATGCAGAAATTTCTACGGAATTGGTAATAACTTTTGGGTCATGGAGTCAT 

CATGGAAGGGGGGCGGGGAACTCCCTGGTGTTGCCATGATGACGGTAAAC 

TGATATGGCGAACTGGTGGGTATGTCACATGAAAAGCTCCTTCCACCCCA 

GCCCTGTTTCAATTAGTCCTCGGTTTGGTCCAGTGTCCAAGTCCTGCCTC 

CAGAGTCAAGTCCCACCCCCTACCTCTTAAGGAGAGATGTAAATACATGG 

aATAGAATTGAGAGTCCAGAAATAATCTCATACATCTATGATCAATTGAT 

TTTCAGCAAAGGTGCCAAGACCATTCAATGAGGGAAAGAATCATATTTTT 

ScAACAAATGGTGCTGGATAACCACATGTGAAAGAATGCAACTGGGCCC 

Sa^t^accatatacagaaattaactcaaaatggctcaaacacttac 
Sgtaagagctaaaact^^^ 

^ATGACCTTGGATTTGCTGGCTGATTCTTAAATGACACTGAAAGCACAA 

gcScaaaagaaaaaaaaataggtaaattggacctcatcaaaatttaaaa 

CTTTTATGCTGGGTGCACACCTGTAATCCCAGCACTTTGGGAGGCTGAGG 
raGGAGGATCTCTTGAGCCCAAGAAGCTGAGGCTACAGTGAGCCGAAATT 

S?aStaaacaaatatataattatagatctctggatcttgcc 

aCTGACTCAACTAACTGGTCTGGGTGGGAGCCCAGCCATTTGTATTTTTT 
ATCATAGGGTTGGACTCCTAACrGGAAACAGTTTGCACCATCAGGTGTCG 

Sg^t^gataatagttaagctttcctcctagattttctgatattaga 
^SStgtttacaagtttttaccaagagacaaactatctttctgccct 
t^tttctctcttatactattctaatcccagaaccctttggaacttccac 

ttcg^ctgtttgtgccagatccagttggtgctggtggtgggacag 
ccctgaaatcccctcctctcccgtcagattcagtcccccatttgcatcac 

aAGG^CAGGAGTTCGAGACCAGCCTGACCAACATAGTGAAACCCCGTCTC 

?a1^a£^tatS^ 

CAGCTACTCAGGAGGCTGAGGCAGGAGAATTGCTTGAACCTGGGAAGCGG 
TGAGACTACATCTCAAAAAAAAAAAAATGAGAGAGAGAGAGAGAAGCAGA 

aIcScaggtgtttcttttggcttaaagtactctgtga^ 

GACACGAAGGATACCATGAACTGAGAGATTTTGGGAACCTCTGCTTTAGA 
AGC^GGAGGTAGCATrcCCTTGGGCACAGTACT^ 

GGGATGTGCTTGGAGACAGAGGGAACTAGATTGAGCTGCCCGATAAAGAC 
A^CCAGCCTGG^GAGTGTAGTGACTCATGTCTGTAATCCTAGTGCTTT 

Sg^ctgaagt^ 

TGGGAAACAACAAGACCTCTACAAAAAAAAAAGAAAAAAAAAATTAACCA 
GAGGATCACTTGAGCCCAGGAAGGTAAGGATACATTGAGCCATGACTGTG 

cI^tgSStagcctggg^ 

aSaaa^taaataaatatatagtggccatgacatccctaga^gaca 

agg^tgggaataggtagaagccaagggaaatgagaaatgagagggg^ 

cctggagctggaactgggggagcaggatggcctctgagaagttcctgata 
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GTGGTGTCACTGATGTGTCTGATGTTTAGTTGTAATTATTTGCTGGGCCC 

CTGTCATCCCTCATATCTGATAGCTCTTTGCTAGTCAAAGTGTGGTCTGG 

GGATCAGCGGCATCAGCATCACTTGAGAACTTGTTAGAGATGCAGAATCT 

AGAGCCCCACCCGGGACCCAGAAACAGAGCCTGCATTTTAACAAGCTCCC 

CAGGTGATTCTCACACACACTCGCATTTGAGAAGCACTGGGCTAGTTGAC 

AGATTCTCAGGCATGGCTGACATTGAAATATCCAGGGAGCAGGCTTGGCA 

TTAGGATGTTTAAAAGTCCTCCAGGTGTTTCTAAAGCCAGGTTTGAGGAA 

TTACTGGGCTGATACAAATGTTTTGTGATGATGCTTTGTGTGTGTGTGTG 

TGTGTG7GTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTAGGGAATTC 

TGGGTCACTTGGCACCAACACAGGAAACAATGGAAATATGTGAGCCATGA 

CAGAAAGGTCAGGAGATAAAAGAAATTAGTGACATGAGAGGTACTCCTCA 

GGTGTTAGGAAAGAGGGTAGAGCAAACCAGGTTTTCCACCATATGTTGGA 

TAGGGGGTCAAGTAAATTTCTACTTAAAAATTACAAACAGGGGCTGGGCG 

CGGTGGCTCATGCCTGTAATCCCGCACTTTGGGAGGCTGAGGAGGGCGGA 

TCACAAGGTCAAGAGATTGAGACCATCCTGGCCAACACGGTGAAACCGTG 

TCTCCACTAAAAATACAAAAATTAGCTGGGCATGGTGGTGCGTGCCTTTA 

TTCCCAGCTACTCGGGAGGCTGAGGCAGGAGAATCGCTTGAACCTGGGAG 

GTGGAGGTTGCAGTGGGCCGAGATCGCACCACTGCAATCCAGAGCGAGAC 

TGTGTCAAAAAAAAAAAAAAAAAGAAAATTCCAAACAGGATGACCCTAAG 

CCTGCAGGACTTGGAGACATCTAGGTGACTGATACTCAGTCACAAAACAT 

AATTGGTCACAGGCCTGATGAAATGCACAGCAGACCTTCAGATGGTATGC 

ACTCAAGTGATATCCACAAGTCCACCTAAAGAAATGCTATATTCAGACAT 

TTGGCATCAATCTCTATCAAACAAAGATAGTCCAAAGCAATGGGTTCCAA 

AAACACTTTCCTAAGACAAATTCTCTATTTGCTTTTAATATCAGTCATCC 

CAGCCCTTGGAATAGAGGAGCAAATGATACCAGTGGTACCCTACCACAAT 

GCACCAAGGTATTATACTCTCATGCTCCATTTTCTCCCTCTGTCTACATC 

ACTAATAACTCATTGATTTCTGGTGCAAGCCCTGCTGGGAGAAAAAGTCT 

ACTCTTGTACCTTGGAGCAAGTTGCTCAGAGTAGGTATCGAGGATAAAAT 

TTGGAAAGTTAGAAAAGCTATTAGAAGGAGATCCTAGTAGTTGAAAACAC 

AGCCTGGCCAAGTCAATGATGCTATTTCATCTCCCCAGCCTTGCATGTCC 

ATAGCTAAGGAAGACAATTTAGGCTTGGGCTAGAGGATGGGAAAGGGCAA 

AATTACTGATGCCACAGCCCAGAGAGGTATTCTAGTAATCTGAGGGTGAG 

GACCACATACCTGGTTCAGGGACGTACAGTGTTGACAGCTGTGAGTGGAT 

GCCTGGAGTTCTGGCGTGTCTTCTAGCACAATGATACCTGAGACTCTTGC 

ATCATTGGGAATAATAAAATGGGAGTGGATAGATATGAAATTATGATGGC 

AATAAGCAATCAGCTAATAGCTTCATTGATGGGACAGATTAAAGATGGCT 

GCAAATCCTTTGGTCCAGGTTTGGGATATAGGCAGCATTTGTATTGGAAT 

GCTGATAGTC7GAGGCCATGAAAAGTCCACCTGCAGTAGTGGTAGGAGGA 

ACAAGCCTCACTTTCTTCAATGTGTGTGACTGCTGTCTTGATTCCCTGGG 

TGGCCAGTTCCATTCGTGTGGTTCTTTGGTCCACTTGACTCTGGGGTGGC 

TCTGTGATGGCTTGACCAATACAATGTAGTGGAAATGATGCTGTCATCAT 

TTCCAGCCTCTTCCAGCCTTAAGGAACTGGCAACTTTTATTTCTGTCCCT 

TGGAATACTTGTTCTTGCAACCCATCCATCATACAGTGAGAAATTCTAAG 

CTGCCCCATTAAGAGGCCCACATGGTGATAAATTGGGGTCTTACATACAG 

CCCTAGCTGTGCTCCTAGCTGACAAACAGTAGCAACTTGTCACCAGGCGA 

GTGAACCACTTAGGACTGTATACTCCAGCCCCAGTTGAGCAATGTGGAAC 

AGAGTAAACCATCTCAGCTTAGCCCTGCCCAAACTGCAGAATTATGAGCA 

AAATAATCCCCTAGGCTTTGGGCTGATTTGTTCCAGATTACTGGAACAGA 

ATTTGGTACCAGGGGTGAGGTGCTACAGCAATGAAAGCTTAAGACACGTG 

ACTTTGGTTTTGGGTCTGAGTGGCAGGGGAACTTGGCAGGCCTCAAGGAA 

ACTTTTAGGGAGGGTTGAAGCATAGTGAGGAAAACAGTAGGGGAAGCTAG 

AGGAAAAAATGATGCTTGGTATGTAGTGGTGGGAAGTTTAGCAAAACTCG 

CCTGATGTAATGTGGGAAATTGTAAGAACTCAGAACGATTTAAGGGCATG 

TTTTATAGGTCCTTTAAGAAACTTCTAGGCCAGGCGCAGTGGCTCATGTC 

TGTAATCCCAGCACTTTGGGAGGCTGAGGTGGGCGGATCACAAGGTCAGG 

AGATCGAGACAATCCTGGCTAACATTGTGAAACCCCGTCTCTACTAAAAC 

TACAAAAAAAAATTAGCCGGGCATGGTGGCGGGTGCCTGTAGTCCCAGCT 

ACTAGGGAGGCTGAGGCAGAAGAATGGCGTGAACCTGGGATGTGGATCTT 

GAAGTGAGCCCAGATTGTGCCACTGCACTCCAGCCTGGGCAACAGAGTGA 

GACT C CGT C T CAAAC CGAAAAAAAAAAAAAAAAAAAGAAACTTCTAGGGC 
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TGGTCCCGTGGAAGCCTCACACATGGTACACAAAGGCTGTCTTGAAAAGA 

AACGTAAGTGTGTTTTTTGGTTTAATAAAATTGATTATAAATGGATAATG 

^AAAACATTTTAAAGAATTTTACTAGCTTACATTAGCAGATTTGGATCCA 

GTGATTGTTACATTCTGGTACTGAGCCCCTGAATTACTTCTTTGAGTAAG 

GCATTATACCAAAGCTATTGATAGTTGGGCTTATAGGGTGTATGTTTGAA 

GAACTACTAATGTCAAAACCAATATTTCACGGTCGACAAGAGGACATCAG 

AACTGGTAATCCTTATTACCATGACTGGCTGGACAGAATACTCAATGTAA 

TGGGATTTCCTGCAAATAAAGACGGGGAAGATGTAAAAAAGATGCCTGAA 

CATTCAACATTAATGAAAGATTTCAGAAGAAATATGTATACTAACTGCAG 

C CTTATCAAGTATATGGAAAAACACAAAGTTAAAC CAGATAGTAAAGCAT 

TCCACTTGCTTCAGAAGTTTCTTACTATGGACCCAATAAAGTGAATTACC 

TGAGAACGGGGTCCCTGTTTCTTCGAAGACCCACTTCCTACATCAGACGT 

TTTCAACAGTTGTCAAATCCCCTACCCAAAATGAGAATTTTTAACAGAAG 

AAGAACCTGATGACAAAGGAGCCAAAAAGAACCACCACCGGCAGCAGGGC 

CATAACCACACGAATGGAACTGGCCACCCAGGAATCAAGACAACGGTCAC 

ACACAGGGACCCCCGTTGAAGAAAGTGAGGCTTGTTCCTCCTACCACTAC 

CTCAGGTGGACTTTTCACGGCCTCAGACTATCCGCGTTCCAATCCACATG 

CTGCCTATATCCCAACCCTGGACCAAGCACATCCCAGCCGAAGAGCAGTG 

TAGGATACTCAGCTACCTCCCAGCAGGCTCCACAGGACCCACGTCAGACA 

CACGGGTACTGAGCTGCATCGGAATCTTGTCCGTGCACTGTTGTGAATGC 

TGCAGGGCTGACTGTGCAGCTCTCCGTGGGAACCTGGTATGGGCCATGAG 

AATGTACTGTACAACCACACCTGCCCAGTAGCCAAGTTCCTTCCACCGCT 

TTTCACAGATCGGGGTAGTGGCTTCCAGTTTGTACCTATTTTGGAGTTAG 

ACCTGAAAAGAAAGCGCTAGCACAGTTTGTGTTGTGGATTTGCTACTTTC 

ATAGTTAACTTGACCTGGCTCAGACTGACCAGTACTTTTTTTTCCGTGAC 

AGTCTATAGCAGTTGAAGCTGAGAATGTGCTAGGGGCAAGCGTTTGTCTT 

CATATGTCATGAATTCCTCCAGTGTAACAACATTATCTGACCAATAGTAC 

ACACACAGACACAAGGTTTAACTGGTACTTGAAAACATACAGTAGGTGTT 

AACT C AGTGAAATAAC CAGGACT CAAAGTAAGATTATTTTGGTACAC CTT 

TCTTGTTAGTGTCTTATCAGTGAGTTGATTCATTTTCTACATTAATCAGT 

GTTTTCTGACCAAGAATATTGCTTGGATTTTTCTGAAAGTACAAAAAGCC 

ACATAGTTTTTTTCAGAAAGGTTTCAAAACTCCTAAAGATTAATTTCCAA 

GTATAAGTTTGTTTTTATTTTCAATCTATGACTTGACTGGTATTAAAGCT 

GCTATTTGATAGTAATTAGATATATTCTCATTGATATAAACCTGTTTGGT 

TCAGCAAACAAACTAAAATGATTGTCACAGACAATGCTTTATTTTTCCTG 

TTGGTGTTGCTTGTGGGAAAAAGAAAGAGAGATCAGATTGTTACTGTGTC 

TGTGTAGAAAGAAGTAGACATAGGAGACTCCATTTTGTTCTGTACTAAGA 

AAAATTC^TCTGCCTTGAGATGCTGTTAATCTATATAACCTTACCCCCAA 

CCCTGTGCTCTCTGAAACATGTGCTGTGTCCACTCAGGGTTAAATGGATT 

AAGGGCGGTGCAAGATGTGCTTTGTTAAACAGATGCTTGAAGGCAGCATG 

CT C GT AAGAGT CATCAC CACT C C CT AAT CT CAAGTAC C CAGGGACACAAA 

CACTGCTGAAGGCCGCAGGGACCTCTGCCTAGGAAAGCCAGGTATTGTCC 

AAGGTTTCTCCCCATGTGATAGTCTGAAATATGGCCTCGTGGGAGGGGAA 

AGACCTGACCGTCCCCCAGCCCGACACCCGTAAAGGGTCTGTGCTGAGGA 

GGATTAGTATACGAGGAAGGAACGCCTCTTTGCAGTTGAGACAAGAGGAA 

GGCATCTGTCTTCTGCCCGTCCCTGGGCAATGGAATGTCTCGGTATAAAA 

CCCGATTTTATGTTCCATCTACTGAGATAGGGGAAAACCACCTTAGGGCT 

GGAGGTGGGACATGCGGCAGCAATACTGCTCTTTAAGACATTGAGATGTT 

TATGTGTATGCATATCTAAAGCACAGCACTTAATTCTTTACCTTGTCTAT 

GTTGCAGAGACCTTTGTTCACGTGTTTATCTGCTGACCTTCTCTCCACTA 

TTATCCTATGACCCTGCCACATCCCCCTCTCCGAGAAACACCCAAGAATG 

ATCAATAAATACTAAGGGAACTCAGAGGCCGGCGGGATCCTCCATATACT 

GAACGCTTGTCCCCTGGGCCCCCTTATTTCTTTCTCTATACTTGGTCTCT 

GTGTCTTTTTCTTTTCCAAGTCTCTCGTTCCACCTAATGAGAAACACCCA 

CAGGTGTAAAGGGGCAACCCACCCCTTCATTGCTGATTTGTGAGCGTGCT 

TTAAGGTGAAAAAAGCATGAATGTTAACTTCCTTAAAAAGGTACAGCATC 

CAATTCAAATATTTTTGTCCTGATTTTAATGCTAGTTGATGTAGTGCTAT 

TAAAATTTTGTTCAACATGGACACAGAGAGGGGAACAACACATACCAGGG 

C^TGTTGCGGGGTGGGGATGAGGGGAGGGAACTTAGAGGACAGGTGAACA 

GGTGCAGCAGATCACCATGGCCCACATATACCTATTTAACAAACCTGCAC 
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GTTCTGCACACGTATCCCA.TTTCTTTTTTTTTTTAAGAAATAGAAAAAAA 
AATAAAATTTTGTTCACTGATTCTTCCATTTTAAAACTTGTTTGCATGTG 
GTTTAGGATGCCCTTACTTCAGCAAAGGAGAAGGAATAGGAGGGCCTTAG 
AATTTTTGAGGGAAAAAAACCCTATAACATACATTGTACTGTATCAAACT 
ATTTTACATGAATGACACAAGTATTCTGAATAAAAAAATAATTGAACATT 
GTTAAGAACAAGGTGTCATGTAATTTATTTTTCATAAATAAAAAAATTAT 
AGTGGCTTAGACTGAAAGGAACAGAGAATTTAAAAAATTAAAAAGAAGCC 
TTAGTATATTTTTGTATATAGTTTCCATGTGCCATATTTGCCATAATTGG 
ATGAGAATTTTTTGACCTCTGGCAGGGTGACCCTATATTTTCANTNTATA 
AAGCGTGCATCATACC 



BNSDOCID: <WO 9S06426A1 J_; 



FIG. A (61 of 61) 

H5 ja^ 



WO 99/06426 



PCT/US98/16102 



CO ^ 

s° < 



51 

= a 
J < 

= > 

^ cn 



< 



3: 

5-2 
= Z5 

- S 

35 g 

< Q 5 
r * cc 

Z 9 * 
> > < 

5r — ^ 

^« 

U Ul "! 

O c — 
55 & e 
Q co ^ 

C > > 
S X 

lis 

* w c 

ill 



aJSDOCID: <WO G906426A1J_> 



WO 99/06426 



PCT/US98/16102 




WO 99/06426 



PCT/US98/16102 




//?//// 



iSDOCID: <WO 990642&A1J_> 



INTERNATIONAL SEARCH REPORT 



International application No. 
PCT/US98/ 16102 



| A. CLASSIFICATION OF SUBJECT MATTER 

IPC(6) :C07H 21/02, 21/04, 1/00, 14/00, 17/00; C12Q 1/68; G01N 33/53 
US CL : 536/23.1; 530/350, 387.1; 435/6, 7.1 
According to International Patent Classification (IPC) or to both national classification and IPC 



FIELDS SEARCHED 



| Minimum documentation searched (classification system followed by classification symbols) 
U.S. : 536723.1; 530/350, 387.1; 435/6, 7.1 

I Documentation searched other than minimum documentation to the extent that such documents are included in the fields searched 



| Electronic data base consulted during the international search (name of date base and, where practicable, search terms used) 
DIALOG: MEDLINE, USPATFUL, WPI, BIOSIS. Search terms include author, "TANGO" and protein 



C DOCUMENTS CONSIDERED TO BE RELEVANT 



Category* 



Citation of document, with indication, where appropriate, of the relevant passages 



Relevant to claim No. 



Database Medline on Dialog, US National Library of Medicine, 
(Bethesda, MD, USA) AN 09370320. SONNENFELD et al. , The 
Drosophila tango gene encodes a bHLH-PAS protein that is 
orthologous to mammalian Arnt and controls CNS midline and 
tracheal development'. Development. November 1997, volume 124, 
number 22, pages 4571-82, Abstract. 



1-22 



| | Further documents are listed in the continuation of Box C. Q See patent family annex 



Special categories of cited documents: 

I mj^m document defining the general stale of the art which is not considered 

to be of particular relevance 

■g» mir— document published on or after the mternattonal flling date 

"L" document which may throw doubts on priority claim (•) or which is 

cited to eetablish the publication date of another citation or other 
special reason (as specified) 

•O* document referring to an oral disclosure, use, exhibition or other 

"P* document published prior to the international filing date but later than 
the priority date claimed 



later document publisbed after the international Tiling date or priority 
date and not in conflict with the application but cited to understand 
the principle or theory underlying the inventioo 

document of particular relevance; the claimed invention cannot be 
considered novel or cannot be considered to involve an inventive step 
when the document is taken alone 

document of particular relevance; the claimed invention cannot be 
considered to involve an inventive step when the document is 
combined with one or more other such documents, such combination 
being obvious to a person skilled in the art 

document member of the same patent family 



Date of the actual completion of the international search 
21 OCTOBER 1998 



Date of mailing of the international search report 



30 0CT1998 



Name and mailing address of the ISA/US 
| Commissioner of Patents and Trademarks 
Box PCT 

Washington, D.C. 20231 
Facsinule No. (703) 305-3230 



Authorized officer 

HEATHER BAKAJLYAR 
Telephone No. (703) 308-0196 




E^ISDOCID: <WO. 



,<&%™m& s ,Y? 10 (second * heet >< Ju, y 1992) * 



